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Foreword 



It is a distinct pleasure for me to write a foreword to this new textbook by my 
long-time friend, Professor Milt Ohring. 

There have been at least 200 books written on various aspects of thin film 
science and technology, but this is the first true textbook, specifically intended 
for classroom use in universities. In my opinion there has been a crying need 
for a real textbook for a long time. Most thin film courses in universities have 
had to use many books written for relatively experienced thin film scientists 
and engineers, often supplemented by notes prepared by the course instructor. 
The Materials Science of Thin Films , a true textbook, complete with 
problems after each chapter, is available to serve as a nucleus for first courses 
in thin film science and technology. 

In addition to his many years of experience teaching and advising graduate 
students at Stevens Institute of Technology, Professor Ohring has been the 
coordinator of an on-premises, M.S. degree program offered by Stevens at the 
AT&T Bell Laboratories in Murray Hill and Whippany, New Jersey. This 
ongoing cooperative program has produced over sixty M.S. graduates to date. 
Several of these graduates have gone on to acquire Ph.D. degrees. The 
combination of teaching, research, and industrial involvement has provided 
Professor Ohring with a broad perspective of thin film science and technology 
and tremendous insight into the needs of students entering this exciting field. 
His insight and experience are quite evident in this textbook. 


John L. Vossen 





Preface 



Thin-film science and technology play a crucial role in the high-tech industries 
that will bear the main burden of future American competitiveness. While the 
major exploitation of thin films has been in microelectronics, there are 
numerous and growing applications in communications, optical electronics, 
coatings of all kinds, and in energy generation and conservation strategies. A 
great many sophisticated analytical instruments and techniques, largely devel¬ 
oped to characterize thin films and surfaces, have already become indispens¬ 
able in virtually every scientific endeavor irrespective of discipline. When I 
was called upon to offer a course on thin films, it became a genuine source of 
concern to me that there were no suitable textbooks available on this unques¬ 
tionably important topic. This book, written with a materials science flavor, is 
a response to this need. It is intended for 

1. Science and engineering students in advanced undergraduate or first-year 
graduate level courses on thin films 

2. Participants in industrial in-house courses or short courses offered by 
professional societies 

3. Mature scientists and engineers switching career directions who require an 
overview of the field. 

Readers should be reasonably conversant with introductory college chem¬ 
istry and physics and possess a passive cultural familiarity with topics com¬ 
monly treated in undergraduate physical chemistry and modem physics courses. 


xiii 
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Preface 


It is worthwhile to briefly elaborate on this book’s title and the connection 
between thin films and the broader discipline of materials science and engineer¬ 
ing. A dramatic increase in our understanding of the fundamental nature of 
materials throughout much of the twentieth century has led to the development 
of materials science and engineering. This period witnessed the emergence of 
polymeric, nuclear, and electronic materials, new roles for metals and ceram¬ 
ics, and the development of reliable methods to process these materials in bulk 
and thin-film form. Traditional educational approaches to the study of materi¬ 
als have stressed structure-property relationships in bulk solids, typically 
utilizing metals, semiconductors, ceramics; and polymers, taken singly or 
collectively as illustrative vehicles to convey principles. The same spirit is 
adopted in this book except that thin solid films are the vehicle. In addition, 
the basic theme has been expanded to include the multifaceted processing- 
structure-properties-performance interactions. Thus the original science 
core is preserved but enveloped by the engineering concerns of processing 
and performance. Within this context, I have attempted to weave threads of 
commonality among seemingly different materials and properties, as well as to 
draw distinctions between materials that exhibit outwardly similiar behavior. In 
particular, parallels and contrasts between films and bulk materials are recur¬ 
ring themes. 

An optional introductory review chapter on standard topics in materials 
establishes a foundation for subsequent chapters. Following a second chapter 
on vacuum science and technology, the remaining text is broadly organized 
into three categories. Chapters 3 and 4 deal with the principles and practices of 
film deposition from the vapor phase. Chapters 5-9 deal with the processes 
and phenomena that influence the structural, chemical, and physical attributes 
of films, and how to characterize them. Topics discussed include nucleation, 
growth, crystal perfection, epitaxy, mass transport effects, and the role of 
stress. These are the common thin-film concerns irrespective of application. 
The final portion of the book (Chapters 10-14) is largely devoted to specific 
film properties (electrical, magnetic, optical, mechanical) and applications, as 
well as to emerging materials and processes. Although the first nine chapters 
may be viewed as core subject matter, the last five chapters offer elective 
topics intended to address individual interests. It is my hope that instructors 
using this book will find this division of topics a useful one. 

Much of the book reflects what is of current interest to the thin-film research 
and development communities. Examples include chapters on chemical vapor 
deposition, epitaxy, interdiffusion and reactions, metallurgical and protective 
coatings, and surface modification. The field is evolving so rapidly that even 
the classics of yesteryear, e.g., Maissel and Glang, Handbook of Thin Film 
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xv 


Technology (1970) and Chopra, Thin Film Phenomena (1969), as well as 
more recent books on thin films, e.g., Pulker, Coatings on Glass (1984), and 
Eckertova, Physics of Thin Films (1986), make little or no mention of these 
now important subjects. 

As every book must necessarily establish its boundaries, I would like to 
point out the following: (1) Except for coatings (Chapter 12) where thicknesses 
range from several to as much as hundreds of microns (1 micron or 1 
= 10“ 6 meter), the book is primarily concerned with films that are less 
than 1 j*m thick. (2) Only films and coatings formed from the gas phase by 
physical (PVD) or chemical vapor deposition (CVD) processes are considered. 
Therefore spin and dip coating, flame and plasma spraying of powders, 
electrolytic deposition, etc., will not be treated. (3) The topic of polymer films 
could easily justify a monograph of its own, and hence will not be discussed 
here. (4) Time and space simply do not allow for development of all topics 
from first principles. (Nevertheless, I have avoided using the unwelcome 
phrase “It can be shown that . . . and have refrained from using other 
textbooks or the research literature to fill in missing steps of derivations.) (5) A 
single set of units (e.g., CGS, MKS, SI, etc.) has been purposely avoided to 
better address the needs of a multifaceted and interdisciplinary audience. 
Common usage, commercial terminology, the research literature and simple 
bias and convenience have all played a role in the ecumenical display of units. 
Where necessary, conversions between different systems of units are provided. 

At the end of each chapter are problems of varying difficulty, and I believe a 
deeper sense of the subject matter will be gained by considering them. Three 
very elegant problems (i.e. 9-6, -7, -8) were developed by Professor W. D. 
Nix, and I thank him for their use. 

By emphasizing immutable concepts, 1 hope this book will be spared the 
specter of rapid obsolescence. However, if this book will in some small 
measure help spawn new technology rendering it obsolete, it will have served a 
useful function. 


Milton Ohring 





Acknowledgments 



At the top of my list of acknowledgments I would like to thank John Vossen 
for his advice and steadfast encouragement over a number of years. This book 
would not have been possible without the wonderfully extensive intellectual 
and physical resources of AT&T Bell Laboratories, Murray Hill, NJ, and the 
careful execution of the text and figures at Stevens Institute. In particular Bell 
Labs library was indispensable and I am indebted to AT & T for allowing me 
to use it. My long association with Bell Labs is largely due to my dear friend 
L. C. Kimerling (Kim), and I thank him and AT&T for supporting my efforts 
there. I am grateful to the many Bell Labs colleagues and students in the 
Stevens Institute of Technology/Bell Labs “On Premises Approved Program 
(OPAP),” who planted the seed for a textbook on thin films. In this regard 
D. C. Jacobson should be singled out for his continuous help with many 
aspects of this work. The following Bell Labs staff members contributed to 
this book through helpful comments and discussions, and by contributing fig¬ 
ures, problems, and research papers: J. C. Bean, J. L. Benton, W. L. Brown, 
F. Capasso, G. K. Celler, A. Y. Cho, J. M. Gibson, H. J. Gossmann, R. Hull, 
R. W. Knoell, R. F. Kopf, Y. Kuk, H. S. Luftman, S. Nakahara, M. B. Panish, 
J. M. Poate, S. M. Sze, K. L. Tai, W. W. Tai, H. Temkin, L. F. Thompson, 
L. E. Trimble, M. J. Vasile, and R. Wolfe. I appreciate their time and effort 
spent on my behalf. 

Some very special people at Stevens enabled the book to reach fruition. 
They include Pat Downes for expertly typing a few versions of the complete 


xvii 



xviii 


Acknowledgments 


text during evenings that she could have spent more pleasantly; Eleanor 
Gehler, for kindly undertaking much additional typing; Kamlesh Patel for his 
professional computerized drafting of the bulk of the figures; Chris Rywalt, 
Manoj Thomas, and Tao Jen for carefully rendering the remainder of the 
figures; Mehboob Alam and Warren Moberly for their computer help in 
compiling the index, and drafting the cover, respectively; Dick Widdicombe, 
Bob Ehrlich, Dan Schwarcz, Lauren Snyder, and Noemia Carvalho for many 
favors; Professor R. Weil for helpful comments; Profs. W. Carr, H. Salwen, 
and T. Hart for their expert and generous assistance on several occasions; 
Professor B. Gallois and G. M. Rothberg for support; and those at Stevens 
responsible for granting my sabbatical leave in 1988. My sincere thanks to all 
of you. 

Lastly, I am grateful to several anonymous reviewers for many pertinent 
comments and for uncovering textual errors. They are absolved of all responsi¬ 
bility for any shortcomings that remain. 

This book is lovingly dedicated to Ahrona, Avi, Noam, and Feigel, who in 
varying degrees had to contend with a less that a full-time husband and father 
for too many years. 



Thin Films — A 
Historical Perspective 


Thin-film technology is simultaneously one of the oldest arts and one of the 
newest sciences. Involvement with thin films dates to the metal ages of 
antiquity. Consider the ancient craft of gold beating, which has been practiced 
continuously for at least four millenia. Gold’s great malleability enables it to be 
hammered into leaf of extraordinary thinness while its beauty and resistance to 
chemical degradation have earmarked its use for durable ornamentation and 
protection purposes. The Egyptians appear to have been the earliest practition¬ 
ers of the art of gold beating and gilding. Many magnificent examples of 
statuary, royal crowns, and coffin cases which have survived intact attest to the 
level of skill achieved. The process involves initial mechanical rolling followed 
by many stages of beating and sectioning composite structures consisting of 
gold sandwiched between layers of vellum, parchment, and assorted animal 
skins. Leaf samples from Luxor dating to the Eighteenth Dynasty (1567-1320 
B.C.) measured 0.3 microns in thickness. As a frame of reference for the 
reader, the human hair is about 75 microns in diameter. Such leaf was 
carefully applied and bonded to smoothed wax or resin-coated wood surfaces 
in a mechanical (cold) gilding process. From Egypt the art spread as indicated 
by numerous accounts of the use of gold leaf in antiquity. 

Today, gold leaf can be machine-beaten to 0.1 micron and to 0.05 micron 
when beaten by a skilled craftsman. In this form it is invisible sideways and 
quite readily absorbed by the skin. It is no wonder then that British gold 
beaters were called upon to provide the first metal specimens to be observed 
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in the transmission electron microscope. Presently, gold leaf is used to deco¬ 
rate such diverse structures and objects as statues, churches, public buildings, 
tombstones, furniture, hand-tooled leather, picture frames and, of course, 
illuminated manuscripts. 

Thin-film technologies related to gold beating, but probably not as old, are 
mercury and fire gilding. Used to decorate copper or bronze statuary, the cold 
mercury process involved carefully smoothing and polishing the metal surface, 
after which mercury was rubbed into it. Some copper dissolved in the 
mercury, forming a very thin amalgam film that left the surface shiny and 
smooth as a mirror. Gold leaf was then pressed onto the surface cold and 
bonded to the mercury-rich adhesive. Alternately, gold was directly amalga¬ 
mated with mercury, applied, and the excess mercury was then driven off by 
heating, leaving a film of gold behind. Fire gilding was practiced well into the 
nineteenth century despite the grave health risk due to mercury vapor. The 
hazard to workers finally became intolerable and provided the incentive to 
develop alternative processes, such as electroplating. 

The history of gold beating and gilding is replete with experimentation and 
process development in diverse parts of the ancient world. Practitioners were 
concerned with the purity and cost of the gold, surface preparation, the 
uniformity of the applied films, adhesion to the substrate, reactions between 
and among the gold, mercury, copper, bronze (copper-tin), etc., process 
safety, color, optical appearance, durability of the final coating, and competi¬ 
tive coating technologies. As we shall see in the ensuing pages, modem 
thin-film technology addresses these same generic issues, albeit with a great 
compression of time. And although science is now in the ascendancy, there is 
still much room for art. 
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A Review of 
Materials Science 


1.1. Introduction 


A cursory consideration of the vast body of solid substances reveals what 
outwardly appears to be an endless multitude of external forms and structures 
possessing a bewildering variety of properties. The branch of study known as 
materials science and engineering evolved in part to classify those features that 
are common among the structure and properties of different materials in a 
manner somewhat reminiscent of chemical or biological classification schemes. 
This dramatically reduces the apparent variety. From this perspective, it turns 
out that solids can be classified as typically belonging to one of only four 
categories (metallic, ionic, covalent, or van der Waals), depending on the 
nature of the electronic structure and resulting interatomic bonding forces. 

Similar divisions occur with respect to the structure of solids. Solids are 
either internally crystalline or noncrystalline. Those that are crystalline can be 
further subdivided according to one of 14 different geometric arrays or lattices, 
depending on the placement of the atoms. When properties are considered, 
there are similar simplifying categorizations. Thus, materials are either good, 
intermediate, or poor conductors of electricity, and they are either mechani¬ 
cally brittle or can easily be stretched without fracture, and they are either 
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optically reflective or transparent, etc. It is, of course, easier to recognize that 
property differences exist than to understand why they exist. Nevertheless, 
much progress has been made in this subject as a result of the research of the 
past 50 years. Basically, the richness in the diversity of materials properties 
occurs because countless combinations of the admixture of chemical composi¬ 
tions, bonding types, crystal structures, and morphologies are available natu¬ 
rally or can be synthesized. 

In this chapter various aspects of structure and bonding in solids are 
reviewed for the purpose of providing the background to better understand the 
remainder of the book. In addition, several topics dealing with thermodynam¬ 
ics and kinetics of atomic motion in materials are also included. These will 
later have relevance to aspects of the stability, formation, and solid-state 
reactions in thin films. Much of this chapter is a condensed adaptation of 
standard treatments of bulk materials, but it is equally applicable to thin films. 
Nevertheless, many distinctions between bulk materials and films exist, and 
they will be stressed where possible. Readers already familiar with concepts of 
materials science may wish to skip this chapter; those who seek deeper and 
broader coverage should consult the bibliography for recommended texts on 
this subject. 


1.2. Structure 


1.2.1. Crystalline Solids 

Many solid materials possess an ordered internal crystal structure despite 
external appearances that are not what we associate with the term crystal¬ 
line— i.e., clear, transparent, faceted, etc. Actual crystal structures can be 
imagined to arise from a three-dimensional array of points geometrically and 
repetitively distributed in space such that each point has identical surroundings. 
There are only 14 ways to arrange points in space having this property, and the 
resulting point arrays are known as Bravais lattices. They are shown in Fig. 
1-1 with lines intentionally drawn in to emphasize the symmetry of the lattice. 
Only a single cell for each lattice is reproduced here, and the point array 
actually stretches in an endlessly repetitive fashion in all directions. If an atom 
or group of two or more atoms is now placed at each Bravais lattice point, a 
physically real crystal structure emerges. Thus, if individual copper atoms 
populated every point of a face-centered cubic (FCC) lattice whose cube edge 
dimension, or so-called lattice parameter, were 3.615 A, the material known as 
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TRICLINIC MONOCLINIC, MONOCUNIC, 

(a 4 f * y * 90, a ♦ b 4 c) SIMPLE BASE CENTERED 



ORTHORHOMBIC 



HEXAGONAL RHOMBOHEDRAL TETRAGONAL 
Figure 1-1. The 14 Bravais space lattices. 


metallic copper would be generated; and similarly for other types of lattices 
and atoms. 

The reader should realize that just as there are no lines in actual crystals, 
there are no spheres. Each sphere in the Cu crystal structure represents the 
atomic nucleus surrounded by a complement of 28 core electrons [i.e., (Is) 2 
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(2s) 2 (2p) 6 (3s) 2 (3p) 6 (3d) 10 ] and a portion of the free-electron gas contributed 
by 4s electrons. Furthermore, these spheres must be imagined to touch in 
certain crystallographic directions, and their packing is rather dense. In FCC 
structures the atom spheres touch along the direction of the face diagonals, 
i.e., [110], but not along the face edge directions, i.e., [100]. This means that 
the planes containing the three face diagonals shown in Fig. l-2a, i.e., the 
(111) plane, are close-packed. On this plane the atoms touch each other in 
much the same way as a racked set of billiard balls on a pool table. All other 
planes in the FCC structure are less densely packed and thus contain fewer 
atoms per unit area. 

Placement of two identical silicon atoms at each FCC point would result in 
the formation of the diamond cubic silicon structure (Fig. l-2c), whereas the 
rock-salt structure (Fig. l-2b) is generated if sodium-chlorine groups were 
substituted for each lattice point. In both cases the positions and orientation of 
the two atoms in question must be preserved from point to point. 

In order to quantitatively identify atomic positions as well as planes and 
directions in crystals, simple concepts of coordinate geometry are utilized. 
First, orthogonal axes are arbitrarily positioned with respect to a cubic lattice 
(e.g., FCC) such that each point can now be identified by three coordinates 


a. b. 




Figure 1-2. (a) (111) plane in FCC lattice; (b) rock-salt structure, e.g., NaCl; Na • , 
Cl •; (c) diamond cubic structure, e.g.. Si, Ge; (d) zinc blende structure, e.g., GaAs. 
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c, d. 



Figure 1-3. (a) coordinates of lattice sites; (b) Miller indices of planes; (c,d) Miller 

indices of planes and directions. 

(Fig. l-3a). If the center of the coordinate axes is taken as x = 0, y = 0, 
z = 0, or (0,0,0), then the coordinates of other nearest equivalent cube comer 
points are (1,0,0) (0, 1,0) (1,0,0), etc. In this framework the two Si atoms 
referred to earlier, situated at the center of the coordinate axes, would occupy 
the (0,0,0) and (1/4,1/4, 1/4) positions. Subsequent repetitions of this 
oriented pair of atoms at each FCC lattice point generate the diamond cubic 
structure in which each Si atom has four nearest neighbors arranged in a 
tetrahedral configuration. Similarly, substitution of the motif (0,0,0) Ga and 
(1/4, 1/4, 1/4) As for each point of the FCC lattice would result in the zinc 
blende GaAs crystal structure (Fig. l-2d). 

Specific crystal planes and directions are frequently noteworthy because 
phenomena such as crystal growth, chemical reactivity, defect incorporation, 
deformation, and assorted properties are not isotropic or the same on all planes 
and in all directions. Therefore, it is important to be able to identify accurately 
and distinguish crystallographic planes and directions. A simple recipe for 
identifying a given plane in the cubic system is the following: 

1. Determine the intercepts of the plane on the three crystal axes in number of 
unit cell dimensions. 

2. Take reciprocals of those numbers. 

3. Reduce these reciprocals to smallest integers by clearing fractions. 
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The result is a triad of numbers known as the Miller indices for the plane in 
question, i.e., ( h,k,I ). Several planes with identifying Miller indices are 
indicated in Fig. 1-3. Note that a negative index is indicated above the integer 
with a minus sign. 

Crystallographic directions shown in Fig. 1-3 are determined by the compo¬ 
nents of the vector connecting any two lattice points lying along the direction. 
If the coordinates of these points are u ] , , Wj and u 2 , v 2 , w 2 , then the 

components of the direction vector are u x - u 2 , u, - v 2 , - vv 2 . When 

reduced to smallest integer numbers, they are placed within brackets and are 
known as the Miller indices for the direction, i.e., [hkl]. I n this notation 
the direction cosi nes f or the given directions are h / \/ h 2 + k 2 4- l 2 , 
k / Vh 2 + k 2 + l 2 , l/Vh 2 + k 2 + l 2 . Thus, the angle a between any two 
directions [h x , k x , / J and [h 2 , k 2 , 1 2 ] is given by the vector dot product 


h x h 2 + k x k 2 + /,/ 2 
^h\ + k\ + l 2 \/h 2 + + / 2 


Two other useful relationships in the crystallography of cubic systems are 
given without proof. 


1. The Miller indices of the direction normal to the (hkl) plan e are [hkl], 

2. The spacing between individual (hkl) planes is a = a 0 / Vh 2 + k 2 + l 2 
where a 0 is the lattice parameter. 


As an illustrative example, we shall calculate the angle between any two 
neighboring tetrahedral bonds in the diamond cubic lattice. The bonds lie along 
[111]-type directions that are specifically taken here to be [TTl] and [111]. 
Therefore, by Eq. 1-1, 


(!)(-!) + 1 (- 1 ) + (!)(!) 

; 1 2 + l 2 + l 2 \f(- l) 2 + (- l) 2 + 1 


— =-and 

2 3 


a = 109.5°. 


These two bond directions lie in a common (llO)-type crystal plane. The 
precise indices of this plane must be (T10) or (llO). This can be seen by noting 
that the dot product between each bond vector and the vector normal to the 
plane in which they lie must vanish. 

We close this brief discussion with some experimental evidence in support of 
the internal crystalline structure of solids. X-ray diffraction methods have very 
convincingly supplied this evidence by exploiting the fact that the spacing 
between atoms is comparable to the wavelength (X) of X-rays. This results in 
easily detected emitted beams of high intensity along certain directions when 
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incident X-rays impinge at critical diffraction angles (0). Under these condi¬ 
tions the well-known Bragg relation 

n\ = 2 a sin 0 (1-2) 

holds, where n is an integer. 

In bulk solids large diffraction effects occur at many values of 0. In thin 
films, however, very few atoms are present to scatter X-rays into the diffracted 
beam when 0 is large. For this reason the intensities of the diffraction lines or 
spots will be unacceptably small unless the incident beam strikes the film 
surface at a near-glancing angle. This, in effect, makes the film look thicker. 
Such X-ray techniques for examination of thin films have been developed and 
will be discussed in Chapter 6. A drawback of thin films relative to bulk solids 
is the long counting times required to generate enough signal for suitable 
diffraction patterns. This thickness limitation in thin films is turned into great 
advantage, however, in the transmission electron microscope. Here electrons 
must penetrate through the material under observation, and this can occur only 
in thin films or specially thinned specimens. The short wavelength of the 
electrons employed enables high-resolution imaging of the lattice structure as 
well as diffraction effects to be observed. As an example, consider the 
remarkable electron micrograph of Fig. 1-4, showing atom positions in a thin 


CoSi 2 


interface - 

— 

Si 


1924 



Figure 1-4. High resolution lattice image of epitaxial CoSi 2 film on (111) Si (< 112) 
projection). (Courtesy J. M. Gibson, AT & T Bell Laboratories). 
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film of cobalt silicide grown with perfect crystalline registry (epitaxially) on a 
silicon wafer. The silicide film-substrate was mechanically and chemically 
thinned normal to the original film plane to make the cross section visible. 
Such evidence should leave no doubt as to the internal crystalline nature of 
solids. 

1.2.2. Amorphous Solids 

In some materials the predictable long-range geometric order characteristic of 
crystalline solids breaks down. Such materials are the noncrystalline amor¬ 
phous or glassy solids exemplified by silica glass, inorganic oxide mixtures, 
and polymers. When such bulk materials are cooled from the melt even at low 
rates, the more random atomic positions that we associate with a liquid are 
frozen in place within the solid. On the other hand, while most metals cannot 
be amorphized, certain alloys composed of transition metal and metalloid 
combinations (e.g., Fe-B) can be made in glassy form but only through 
extremely rapid quenching of melts. The required cooling rates are about 10 6 
°C/sec, and therefore heat transfer considerations limit bulk glassy metals to 
foil, ribbon, or powder shapes typically — 0.05 mm in thickness or dimen¬ 
sion. In general, amorphous solids can retain their structureless character 
practically indefinitely at low temperatures even though thermodynamics sug¬ 
gests greater stability for crystalline forms. Crystallization will, however, 
proceed with release of energy when these materials are heated to appropriate 
elevated temperatures. The atoms then have the required mobility to seek out 
equilibrium lattice sites. 

Thin films of amorphous metal alloys, semiconductors, oxides, and chalco- 
genide glasses have been readily prepared by common physical vapor deposi¬ 
tion (evaporation and sputtering) as well as chemical vapor deposition (CVD) 
methods. Vapor quenching onto cryogenically cooled glassy substrates has 
made it possible to make alloys and even pure metals—the most difficult of all 
materials to amorphize—glassy. In such cases, the surface mobility of deposit¬ 
ing atoms is severely restricted, and a disordered atomic configuration has a 
greater probability of being frozen in. 

Our present notions of the structure of amorphous inorganic solids are 
extensions of models first established for silica glass. These depict amorphous 
Si0 2 to be a random three-dimensional network consisting of tetrahedra joined 
at the comers but sharing no edges or faces. Each tetrahedron contains a 
central Si atom bonded to four vertex oxygen atoms, i.e., (Si0 4 )' 4 . The 
oxygens are, in turn, shared by two Si atoms and are thus positioned as the 
pivotal links between neighboring tetrahedra. In crystalline quartz the tetrahe- 
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a. 





Figure 1-5. Schematic representation of (a) crystalline quartz; (b) random network 
(amorphous); (c) mixture of crystalline and amorphous regions. (Reprinted with permis¬ 
sion from John Wiley and Sons, E. H. Nicollian and J. R. Brews, MOS Physics and 
Technology , Copyright © 1983, John Wiley and Sons). 


dra cluster in an ordered six-sided ring pattern, shown schematically in Fig. 
l-5a, should be contrasted with the completely random network depicted in 
Fig. l-5b. In actuality, the glassy solid structure is most probably a compro¬ 
mise between the two extremes consisting of a considerable amount of short- 
range order and microscopic regions (i.e., less than 100 A in size) of 
crystallinity (Fig. l-5c). The loose disordered network structure allows for a 
considerable amount of “holes” or “vacancies” to exist, and it, therefore, 
comes as no surprise that the density of glasses will be less than that of their 
crystalline counterparts. In quartz, for example, the density is 2.65 g/cm 3 , 
whereas in silica glass it is 2.2 g/cm 3 . Amorphous silicon, which has found 
commercial use in thin-film solar cells, is, like silica, tetrahedrally bonded and 
believed to possess a similar structure. We return later to discuss further 
structural aspects and properties of amorphous films in various contexts 
throughout the book. 
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1.3. Defects in Solids 


The picture of a perfect crystal structure repeating a particular geometric 
pattern of atoms without interruption or mistake is somewhat exaggerated. 
Although there are materials—carefully grown silicon single crystals, for 
example—that have virtually perfect crystallographic structures extending over 
macroscopic dimensions, this is not generally true in bulk materials. In thin 
crystalline films the presence of defects not only serves to disrupt the geomet¬ 
ric regularity of the lattice on a microscopic level, it also significantly 
influences many film properties, such as chemical reactivity, electrical conduc¬ 
tion, and mechanical behavior. The structural defects briefly considered in this 
section are grain boundaries, dislocations, and vacancies. 


1.3.1. Grain Boundaries 

Grain boundaries are surface or area defects that constitute the interface 
between two single-crystal grains of different crystallographic orientation. The 
atomic bonding, in particular grains, terminates at the grain boundary where 
more loosely bound atoms prevail. Like atoms on surfaces, they are necessar¬ 
ily more energetic than those within the grain interior. This causes the grain 
boundary to be a heterogeneous region where various atomic reactions and 
processes, such as solid-state diffusion and phase transformation, precipitation, 
corrosion, impurity segregation, and mechanical relaxation, are favored or 
accelerated. In addition, electronic transport in metals is impeded through 
increased scattering at grain boundaries, which also serve as charge recombina¬ 
tion centers in semiconductors. Grain sizes in films are typically from 0.01 to 
1.0 fx m in dimension and are smaller, by a factor of more than 100, than 
common grain sizes in bulk materials. For this reason, thin films tend to be 
more reactive than their bulk counterparts. The fraction of atoms associated 
with grain boundaries is approximately 3 a/l, where a is the atomic dimension 
and / is the grain size. For / = 1000 A, this corresponds to about 5 in 1000. 

Grain morphology and orientation in addition to size control are not only 
important objectives in bulk materials but are quite important in thin-film 
technology. Indeed a major goal in microelectronic applications is to eliminate 
grain boundaries altogether through epitaxial growth of single-crystal semicon¬ 
ductor films onto oriented single-crystal substrates. Many special techniques 
involving physical and chemical vapor deposition methods are employed in this 
effort, which continues to be a major focus of activity in semiconductor 
technology. 




1.3. Defects in Solids 


11 


1.3.2. Dislocations 

Dislocations are line defects that bear a definite crystallographic relationship to 
the lattice. The two fundamental types of dislocations—the edge and the screw 
—are shown in Fig. 1-6 and are represented by the symbol X . The edge 
dislocation results from wedging in an extra row of atoms; the screw disloca¬ 
tion requires cutting followed by shearing of the perfect crystal lattice. The 
geometry of a crystal containing a dislocation is such that when a simple closed 
traverse is attempted about the crystal axis in the surrounding lattice, there is a 
closure failure; i.e., one finally arrives at a lattice site displaced from the 
starting position by a lattice vector, the so-called Burgers vector b. The 
individual cubic cells representing the original undeformed crystal lattice are 
now distorted somewhat in the presence of dislocations. Therefore, even 
without application of external forces on the crystal, a state of internal stress 
exists around each dislocation. Furthermore, the stresses differ around edge 
and screw dislocations because the lattice distortions differ. Close to the 
dislocation axis the stresses are high, but they fall off with distance (r) 
according to a 1 /r dependence. 

Dislocations are important because they have provided a model to help 
explain a great variety of mechanical phenomena and properties in all classes 
of crystalline solids. An early application was the important process of plastic 
deformation, which occurs after a material is loaded beyond its limit of elastic 
response. In the plastic range, specific planes shear in specific directions 
relative to each other much as a deck of cards shear from a rectangular prism 






EDGE DISLOCATION SCREW DISLOCATION 

Figure 1-6. (left) Edge dislocation; (right) screw dislocation. (Reprinted with per¬ 
mission from John Wiley and Sons, H. W. Hayden, W. G. Moffatt, and J. Wulff, The 
Structure and Properties of Materials , Vol. in. Copyright © 1965, John Wiley and 
Sons). 
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Figure 1-7. (a) Edge dislocation motion through lattice under applied shear stress. 

(Reprinted with permission from J. R. Shackelford, Introduction to Materials Science 
for Engineers, Macmillan, 1985). (b) Dislocation model of a grain boundary. The 
crystallographic misorientation angle 0 between grains is b/d h . 


to a parallelepiped. Rather than have rows of atoms undergo a rigid group 
displacement to produce the slip offset step at the surface, the same amount of 
plastic deformation can be achieved with less energy expenditure. This alterna¬ 
tive mechanism requires that dislocations undulate through the crystal, making 
and breaking bonds on the slip plane until a slip step is produced, as shown in 
Fig. l-7a. Dislocations thus help explain why metals are weak and can be 
deformed at low stress levels. Paradoxically, dislocations can also explain why 
metals work-harden or get stronger when they are deformed. These explana¬ 
tions require the presence of dislocations in great profusion. In fact, a density 
of as many 10 12 dislocation lines threading 1 cm 2 of surface area has been 
observed in highly deformed metals. Many deposited polycrystalline metal thin 
films also have high dislocation densities. Some dislocations are stacked 
vertically, giving rise to so-called small-angle grain boundaries (Fig. 1 -7b). 
The superposition of externally applied forces and internal stress fields of 
individual or groups of dislocations, arrayed in a complex three-dimensional 
network, sometimes makes it more difficult for them to move and for the 
lattice to deform easily. 

The role dislocations play in thin films is varied. As an example, consider 
the deposition of atoms onto a single-crystal substrate in order to grow an 
epitaxial single-crystal film. If the lattice parameter in the film and substrate 
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differ, then some geometric accommodation in bonding may be required at the 
interface, resulting in the formation of interfacial dislocations. The latter are 
unwelcome defects particularly if films of high crystalline perfection are 
required. For this reason, a good match of lattice parameters is sought for 
epitaxial growth. Substrate steps and dislocations should also be eliminated 
where possible prior to growth. If the substrate has screw dislocations emerg¬ 
ing normal to the surface, depositing atoms may perpetuate the extension of the 
dislocation spiral into the growing film. Like grain boundaries in semiconduc¬ 
tors, dislocations can be sites of charge recombination or generation as a result 
of uncompensated “dangling bonds.” Film stress, thermally induced mechani¬ 
cal relaxation processes, and diffusion in films are all influenced by disloca¬ 
tions. 

1.3.3. Vacancies 

The last type of defect considered is the vacancy. Vacancies are point defects 
that simply arise when lattice sites are unoccupied by atoms. Vacancies form 
because the energy e f required to remove atoms from interior sites and place 
them on the surface is not particularly high. This low energy, coupled with the 
increase in the statistical entropy of mixing vacancies among lattice sites, gives 
rise to a thermodynamic probability that an appreciable number of vacancies 
will exist, at least at elevated temperature. The fraction / of total sites that will 
be unoccupied as a function of temperature T is predicted to be approximately 

f=e ~ £ ' /kT , (1-3) 

reflecting the statistical thermodynamic nature of vacancy formation. Noting 
that k is the gas constant and e f is typically 1 eV/atom gives / = 10” 5 at 
1000 K. 

Vacancies are to be contrasted with dislocations, which are not thermody¬ 
namic defects. Because dislocation lines are oriented along specific crystallo¬ 
graphic directions, their statistical entropy is low. Coupled with a high 
formation energy due to the many atoms involved, thermodynamics would 
predict a dislocation content of less than one per crystal. Thus, although it is 
possible to create a solid devoid of dislocations, it is impossible to eliminate 
vacancies. 

Vacancies play an important role in all processes related to solid-state 
diffusion, including recrystallization, grain growth, sintering, and phase trans¬ 
formations. In semiconductors, vacancies are electrically neutral as well as 
charged and can be associated with dopant atoms. This leads to a variety of 
normal and anomalous diffusional doping effects. 
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1.4. Bonding of Materials 


Widely spaced isolated atoms condense to form solids due to the energy 
reduction accompanying bond formation. Thus, if N atoms of type A in the 
gas phase (g) combine to form a solid (s), the binding energy E b is released 
according to the equation 

NA g ^NA s + E b . (M) 

Energy E b must be supplied to reverse the equation and decompose the solid. 
The more stable the solid, the higher is its binding energy. It has become the 
custom to picture the process of bonding by considering the energetics within 
and between atoms as the interatomic distance progressively shrinks. In each 
isolated atom, the electron energy levels are discrete, as shown on the 
right-hand side of Fig. l-8a. As the atoms approach one another, the individual 
levels split, as a consequence of an extension of the Pauli exclusion principle, 
to a collective solid; namely, no two electrons can exist in the same quantum 
state. Level splitting and broadening occur first for the valence or outer 
electrons, since their electron clouds are the first to overlap. During atomic 
attraction, electrons populate these lower energy levels, reducing the overall 
energy of the solid. With further dimensional shrinkage, the overlap increases 
and the inner charge clouds begin to interact. Ion-core overlap now results in 
strong repulsive forces between atoms, raising the system energy. A compro¬ 
mise is reached at the equilibrium interatomic distance in the solid where the 
system energy is minimized. At equilibrium, some of the levels have broad¬ 
ened into bands of energy levels. The bands span different ranges of energy, 
depending on the atoms and specific electron levels involved. Sometimes as in 
metals, bands of high energy overlap. Insulators and semiconductors have 
energy gaps of varying width between bands where electron states are not 
allowed. The whys and hows of energy-level splitting, band structure evolu¬ 
tion, and implications with regard to property behavior are perhaps the most 
fundamental and difficult questions in solid-state physics. We briefly return to 
the subject of electron-band structure after introducing the classes of solids. 

An extension of the ideas expressed in Fig. l-8a is commonly made by 
simplifying the behavior to atoms as a whole, in which case the potential 
energy of interaction V(r) is plotted as a function of interatomic distance r in 
Fig. l-8b. The generalized behavior shown is common for all classes of solid 
materials, regardless of the type of bonding or crystal structure. Although the 
mathematical forms of the attractive or repulsive portions are complex, a 
number of qualitative features of these curves are not difficult to understand. 
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Figure 1-8. Splitting of electron levels (a) and energy of interaction between atoms 
(b) as a function of interatomic spacing. V(r) vs. r shown schematically for bulk and 
surface atoms. 




16 


A Review of Materials Science 


For example, the energy at the equilibrium spacing r = a 0 is the binding 
energy. Solids with high melting points tend to have high values of E b . The 
curvature of the potential energy is a measure of the elastic stiffness of the 
solid. To see this, we note that around a 0 the potential energy is approximately 
harmonic or parabolic. Therefore, V(r) = (1/2) K s r 2 y where K s is related to 
the spring constant (or elastic modulus). Narrow wells of high curvature are 
associated with large values of K s , broad wells of low curvature with small 
values of K s . Since the force F between atoms is given by F= ~dV / dr , 
F = -K s r, which has its counterpart in Hooke’s law—i.e., that stress is 
linearly proportional to strain. Thus, in solids with high K s values, corre¬ 
spondingly larger stresses develop under loading. Interestingly, a purely 
parabolic behavior for V implies a material with a coefficient of thermal 
expansion equal to zero. In real materials, therefore, some asymmetry or 
anharmonicity in V{r) exists. 

For the most part, atomic behavior within a thin solid film can also be 
described by a V{r)-r curve similar to that for the bulk solid. The surface 
atoms are less tightly bound, however, which is reflected by the dotted line 
behavior in Fig. l-8b. The difference between the energy minima for surface 
and bulk atoms is a measure of the surface energy of the solid. From the 
previous discussion, surface layers would tend to be less stiff and melt at lower 
temperatures than the bulk. Slight changes in equilibrium atomic spacing or 
lattice parameter at surfaces may also be expected. 

Despite apparent similarities, there are many distinctions between the four 
important types of solid-state bonding and the properties they induce. A 
discussion of these individual bonding categories follows. 

1.4.1. Metallic 

The so-called metallic bond occurs in metals and alloys. In metals the outer 
valence electrons of each atom form part of a collective free-electron cloud or 
gas that permeates the entire lattice. Even though individual electron-electron 
interactions are repulsive, there is sufficient electrostatic attraction between the 
free-electron gas and the positive ion cores to cause bonding. 

What distinguishes metals from all other solids is the ability of the electrons 
to respond readily to applied electric fields, thermal gradients, and incident 
light. This gives rise to high electrical and thermal conductivities as well as 
high reflectivities. Interestingly, comparable properties are observed in liquid 
metals, indicating that aspects of metallic bonding and the free-electron model 
are largely preserved even in the absence of a crystal structure. Metallic 
electrical resistivities typically ranging from 10“ 5 to 10“ 6 ohm-cm should be 
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contrasted with the much, much larger values possessed by other classes of 
solids. 

Furthermore, the temperature coefficient of resistivity is positive. Metals 
thus become poorer electrical conductors as the temperature is raised. The 
reverse is true for all other classes of solids. The conductivity of pure metals is 
always reduced with low levels of impurity alloying, which is also contrary to 
the usual behavior in other solids. The effect of both temperature and alloying 
element additions on metallic conductivity is to increase electron scattering, 
which in effect reduces the net component of electron motion in the direction 
of the applied electric field. On the other hand, in ionic and semiconductor 
solids production of more charge carriers is the result of higher temperatures 
and solute additions. 

The bonding electrons are not localized between atoms; thus, metals are said 
to have nondirectional bonds. This causes atoms to slide by each other and 
plastically deform more readily than is the case, for example, in covalent 
solids, which have directed atomic bonds. 

Examples of thin-metal-film applications include A1 contacts and intercon¬ 
nections in integrated circuits, and ferromagnetic alloys for data storage 
applications. Metal films are also used in mirrors, in optical systems, and as 
decorative coatings of various components and packaging materials. 

1.4.2. Ionic 

Ionic bonding occurs in compounds composed of strongly electropositive 
elements (metals) and strongly electronegative elements (nonmetals). The 
alkali halides (NaCl, LiF, etc.) are the most unambiguous examples of 
ionically bonded solids. In other compounds, such as oxides, sulfides, and 
many of the more complex salts of inorganic chemistry (e.g., nitrates, sulfates, 
etc.), the predominant, but not necessarily exclusive, mode of bonding is ionic 
in character. In the rock-salt structure of NaCl, for example, there is an 
alternating three-dimensional checkerboard array of positively charged cations 
and negatively charged anions. Charge transfer from the 3s electron level of 
Na to the 3p level of Cl creates a single isolated NaCl molecule. In the solid, 
however, the transferred charge is distributed uniformly among nearest neigh¬ 
bors. Thus, there is no preferred directional character in the ionic bond since 
the electrostatic forces between spherically symmetric inert gaslike ions is 
independent of orientation. 

Much success has been attained in determining the bond energies in alkali 
halides without resorting to quantum mechanical calculation. The alternating 
positive and negative ionic charge array suggests that Coulombic pair interac- 
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tions are the cause of the attractive part of the interatomic potential, which 
varies simply as — 1 /r. Ionic solids are characterized by strong electrostatic 
bonding forces and, thus, relatively high binding energies and melting points. 
They are poor conductors of electricity because the energy required to transfer 
electrons from anions to cations is prohibitively large. At high temperatures, 
however, the charged ions themselves can migrate in an electric field, resulting 
in limited electrical conduction. Typical resistivities for such materials can 
range from 10 6 to 10 15 ohm-cm. 

Among the ionic compounds employed in thin-film technology are MgF 2 , 
ZnS, and CeF 3 , which are used in antireflection coatings on optical compo¬ 
nents. Assorted thin-film oxides and oxide mixtures such as Y 3 Fe 5 0 12 , 
Y 3 A1 5 0 12 , and LiNb0 3 are employed in components for integrated optics. 
Transparent electrical conductors such as In 2 0 3 -Sn0 2 glasses, which serve as 
heating elements in window defrosters on cars as well as electrical contacts 
over the light exposed surfaces of solar cells, have partial ionic character. 

1.4.3. Covalent 

Covalent bonding occurs in elemental as well as compound solids. The 
outstanding examples are the elemental semiconductors Si, Ge, and diamond, 
and the III-V compound semiconductors such as GaAs and InP. Whereas 
elements at the extreme ends of the periodic table are involved in ionic 
bonding, covalent bonds are frequently formed between elements in neighbor¬ 
ing columns. The strong directional bonds characteristic of the group IV 
elements are due to the hybridization or mixing of the s and p electron wave 
functions into a set of orbitals which have high electron densities emanating 
from the atom in a tetrahedral fashion. A pair of electrons contributed by 
neighboring atoms makes a covalent bond, and four such shared electron pairs 
complete the bonding requirements. 

Covalent solids are strongly bonded hard materials with relatively high 
melting points. Despite the great structural stability of semiconductors, rela¬ 
tively modest thermal stimulation is sufficient to release electrons from filled 
valence bonding states into unfilled electron states. We speak of electrons 
being promoted from the valence band to the conduction band, a process that 
increases the conductivity of the solid. Small dopant additions of group III 
elements like B and In as well as group V elements like P and As take up 
regular or substitutional lattice positions within Si and Ge. The bonding 
requirements are then not quite met for group III elements, which are one 
electron short of a complete octet. An electron deficiency or hole is thus 
created in the valence band. 
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For each group V dopant an excess of one electron beyond the bonding octet 
can be promoted into the conduction band. As the name implies, semiconduc¬ 
tors lie between metals and insulators insofar as their ability to conduct 
electricity is concerned. Typical semiconductor resistivities range from 10“ 3 
to 10 5 ohm-cm. Both temperature and level of doping are very influential in 
altering the conductivity of semiconductors. Ionic solids are similar in this 
regard. 

The controllable spatial doping of semiconductors over very small lateral 
and transverse dimensions is a critical requirement in processing integrated 
circuits. Thin-film technology is thus simultaneously practiced in three dimen¬ 
sions in these materials. Similarly, there is a great necessity to deposit 
compound semiconductor thin films in a variety of optical device applications. 
Other largely covalent materials such as SiC, TiC, and BN have found coating 
applications where hard, wear-resistant surfaces are required. They are usually 
deposited by chemical vapor deposition methods and will be discussed at length 
in Chapter 12. 

1.4.4. van der Waals Forces 

A large group of solid materials are held together by weak molecular forces. 
This so-called van der Waals bonding is due to dipole-dipole charge interac¬ 
tions between molecules that, though electrically neutral, have regions possess¬ 
ing a net positive or negative charge distribution. Organic molecules such as 
methane and inert gas atoms are weakly bound together in the solid by these 
charges. Such solids have low melting points and are mechanically weak. Thin 
polymer films used as photoresists or for sealing and encapsulation purposes 
contain molecules that are typically bonded by van der Waals’ forces. 

1.4.5. Energy-Band Diagrams 

A common graphic means of distinguishing between different classes of solids 
involves the use of energy-band diagrams. Reference to Fig. l-8a shows how 
individual energy levels broaden into bands when atoms are brought together 
to form solids. What is of interest here are the energies of electrons at the 
equilibrium atomic spacing in the crystal. For metals, insulators, and semicon¬ 
ductors the energy-band structures at the equilibrium spacing are schematically 
indicated in Fig. l-9a, b, c. In each case the horizontal axis can be loosely 
interpreted as some macroscopic distance within the solid with much larger 
than atomic dimensions. This distance spans a region within the homogeneous 
bulk interior where the band energies are uniform from point to point. The 



20 


A Review of Materials Science 



V BAND | | V BAND | ffiV'-'T. Wv ; : : ¥ : 'j 

N-TYPE P-TYPE ’’ N..P 

JUNCTION 


Figure 1-9. Schematic band structure for (a) metal; (b) insulator, (c) semiconductor; 
(d) N-type semiconductor; (e) P-type semiconductor; (0 P-N semiconductor junction. 

uppermost band shown is called the conduction band because once electrons 
access its levels, they are essentially free to conduct electricity. 

Metals have high conductivity because the conduction band contains elec¬ 
trons from the outset. One has to imagine that there are a mind-boggling 10 22 
electrons per cubic centimeter (~ one per atom) in the conduction band, all of 
which occupy different quantum states. Furthermore, there are enormous 
numbers of states all at the same energy level, a phenomenon known as 
degeneracy. Lastly, the energy levels are extremely closely spaced and com¬ 
pressed within a typical 5-eV conduction-band energy width. The available 
electrons occupy states within the band up to a certain level known as the 
Fermi energy E f . Above E f are densely spaced excited levels, but they are all 
vacant. If electrons are excited sufficiently (e.g., by photons or through 
heating), they can gain enough energy to populate these states or even leave the 
metal altogether (i.e., by photo- and thermionic emission) and enter the 
vacuum. As indicated in Fig. l-9a, the energy difference between the vacuum 
level and E f is equal to q<f > M , where <j> M is the work function in volts and q 
is the electronic charge. Even under very tiny electric fields, the electrons in 
states at E f can easily move into the unoccupied levels above it, resulting in a 
net current flow. For this reason, metals have high conductivities. 

At the other extreme are insulators, in which the conduction band normally 
has no electrons. The valence electrons used in bonding completely fill the 
valence band. A large energy gap E g ranging from 5 to 10 eV separates the 
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filled valence band from the empty conduction band. There are normally no 
states and therefore no electrons within the energy gap. In order to conduct 
electricity, electrons must acquire sufficient energy to span the energy gap, but 
for all practical cases this energy barrier is all but insurmountable. 

Pure (intrinsic) semiconductors at very low temperatures have a band 
structure like that of insulators, but E g is smaller; e.g., E g = 1.1 eV in Si and 
0.68 eV in Ge. When a semiconductor is doped, new states are created within 
the energy gap. The electron (or hole) states associated with donors (or 
acceptors) are usually only a small fraction of an electron volt from the bottom 
of the conduction band (or top of the valence band). It now takes very little 
stimulation to excite electrons or holes to conduct electricity. The actual 
location of E f with respect to the band diagram depends on the type and 
amount of doping atoms present. In an intrinsic semiconductor, E f lies in the 
middle of the energy gap, because E f is strictly defined as that energy level 
for which the probability of occupation is 1/2. If the semiconductor is doped 
with donor atoms to make it TV-type, E f lies above the midgap energy, as 
shown in Fig. l-9d. If acceptor atoms are the predominant dopants, E f lies 
below the midgap energy and a P-type semiconductor results (Fig. l-9e). 

Band diagrams have important implications in thin-film systems where 
composite layers of different materials are involved. A simple example is the 
P-Adjunction, which is shown in Fig. l-9f without any applied electric fields. 
A condition ensuring thermodynamic equilibrium for the electrons is that E f 
must be constant throughout the system. This is accomplished through electron 
transfer from the N side with high E f (low <p N ) to the P side with low E f 
(high <j> p ). An internal built-in electric field is established due to this charge 
transfer resulting in both valence- and conduction-band bending in the junction 
region. In the bulk of each semiconductor, the bands are unaffected as 
previously noted. Similar band bending occurs in thin-film metal-semiconduc¬ 
tor contacts, semiconductor superlattices, and in metal-oxide semiconductor 
(MOS) structures over dimensions comparable to the film thicknesses in¬ 
volved. Reference to some of these thin-film structures will be made in later 
chapters. 


1.5. Thermodynamics of Materials 


Thermodynamics is definite about events that are impossible. It will say, for 
example, that reactions or processes are thermodynamically impossible. Thus, 
gold films do not oxidize and atoms do not normally diffuse up a concentration 
gradient. On the other hand, thermodynamics is noncommittal about permissi- 
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ble reactions and processes. Thus, even though reactions are thermodynami¬ 
cally favored, they may not occur in practice. Films of silica glass should 
revert to crystalline form at room temperature according to thermodynamics, 
but the solid-state kinetics are so sluggish that for all practical purposes 
amorphous Si0 2 is stable. A convenient measure of the extent of reaction 
feasibility is the free-energy function G defined as 

G = H - TS, (1-5) 

where H is the enthalpy, S the entropy, and T the absolute temperature. 
Thus, if a system changes from some initial (i) to final (0 state at constant 
temperature due to a chemical reaction or physical process, a free-energy 
change AG = G f - Goccurs given by 

AG = AH - TAS, (1-6) 

where AH and AS are the corresponding enthalpy and entropy changes. A 
consequence of the second law of thermodynamics is that spontaneous reac¬ 
tions occur at constant temperature and pressure when AG < 0. This condition 
implies that systems will naturally tend to minimize their free energy and 
successively proceed from a value G { to a still lower, more negative value G, 
until it is no longer possible to reduce G further. When this happens, AG = 0. 
The system has achieved equilibrium, and there is no longer a driving force for 
change. 

On the other hand, for a process that cannot occur, AG > 0. Note that 
neither the sign of AH nor of AS taken individually determines reaction 
direction; rather it is the sign of the combined function AG that is crucial. 
Thus, during condensation of a vapor to form a solid film, AS < 0 because 
fewer atomic configurations exist in the solid. The decrease in enthalpy, 
however, more than offsets that in entropy, and the net change in AG is 
negative. 

The concept of minimization of free energy as a criterion for both stability in 
a system and forward change in a reaction or process is a central theme in 
materials science. The following discussion will develop concepts of thermody¬ 
namics used in the analysis of chemical reactions and phase diagrams. Subse¬ 
quent applications will be made to such topics as chemical vapor deposition, 
interdiffusion, and reactions in thin Films. 

1.5.1. Chemical Reactions 

The general chemical reaction involving substances A, B, and C in equilibrium 
is 


a A + bB cC. 


( 1 - 7 ) 
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The free-energy change of the reaction is given by 

AG — cG c — gG a — bG B , (1-8) 

where a , b y and c are the stoichiometric coefficients. It is customary to denote 
the free energy of individual reactant or product atomic or molecular species 
by 

G, = G“ + RT In a x . (1-9) 

G,° is the free energy of the species in its reference or standard state. For 
solids this is usually the stable pure material at 1 atm at a given temperature. 
The activity a x may be viewed as an effective thermodynamic concentration 
and reflects the change in free energy of the species when it is not in its 
standard state. Substitution of Eq. 1-9 into Eq. 1-8 yields 

Qr 

AG = AG° + RT\n— ~r y (1-10) 

<4 


where AG° = cG£ - aG £ - bG B . If the system is now in equilibrium, 
AG = 0 and a x is the equilibrium value a i(eq) . Thus, 

0 = AG° + RT In! gc(eq * 1 (l-ll) 

1 ) 

or 

- AG° = RT In Ky (1-12) 

where the equilibrium constant K is defined by the quantity in braces. 
Equation 1-12 is one of the most frequently used equations in chemical 
thermodynamics and will be helpful in analyzing CVD reactions. 

Combining Eqs. 1-10 and 1-11 gives 


AG = RT\ n 


4 C(tq) 


^A(eq) 


Y B(cq) 


(1-13) 


Each term a x represents a supersaturation of the species if it exceeds 1, 

and a subsaturation if it is less than 1. Thus, if there is a supersaturation of 
reactants and a subsaturation of products, AG < 0. The reaction proceeds 
spontaneously as written with a driving force proportional to the magnitude of 
AG. For many practical cases the a x differ little from the standard-state 
activities, which are taken to be unity. Therefore, in such a case Eq. 1-10 
yields 


AG = AG°. 


(1-14) 
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Quantitative information on the feasibility of chemical reactions is thus 
provided by values of AG°, and these are tabulated in standard references on 
thermodynamic data. The reader should be aware that although much of the 
data are the result of measurement, some values are inferred from various 
connecting thermodynamic laws and relationships. Thus, even though the 
vapor pressure of tungsten at room temperature cannot be directly measured, 
its value is nevertheless “known.” In addition, the data deal with equilibrium 
conditions only, and many reactions are subject to overriding kinetic limita¬ 
tions despite otherwise favorable free-energy considerations. 

A particularly useful representation of AG° data for formation of metal 
oxides as a function of temperature is shown in Fig. 1-10 and is known as an 
Ellingham diagram. As an example of its use, consider two oxides of impor¬ 
tance in thin-film technology, Si0 2 and A1 2 0 3 , with corresponding oxidation 
reactions 

Si + 0 2 -► Si0 2 ; AG J i02 , (1 -15a) 

(4/3)AJ + 0 2 - (2/3)Al 2 0 3 ; AG’ 2<v (M5b) 


Through elimination of 0 2 the reaction 

(4/3)A1 + Si0 2 -► (2/3 )a 1 2 0 3 4- Si (1-16) 

results, where AG° = AG^q 3 - AG^. Since the AG°~T curve for A1 2 0 3 
is more negative or lower than that for Si0 2 , the reaction is thermodynami¬ 
cally favored as written. At 400 °C, for example, AG° for Eq. 1-16 is 
-233 — (— 180) = —53 kcal/mole. Therefore, A1 films tend to reduce Si0 2 
films, leaving free Si behind, a fact observed in early field effect transistor 
structures. This was one reason for the replacement of A1 film gate electrodes 
by poly crystal line Si films. As a generalization then, the metal of an oxide that 
has a more negative AG° than a second oxide will reduce the latter and be 
oxidized in the process. Further consideration of Eqs. 1-12 and 1-15b indicates 
that 


(gAl z Q 3 ) 2/3 


exp - 


A G° 
RT ‘ 


(1-17) 


The A1 2 0 3 and A1 may be considered to exist in pure standard states with unity 
activities while the activity of 0 2 is taken to be its partial pressure P 0 . 
Therefore, AG° = RT In P 02 . If A1 were evaporated from a crucible to 
produce a film, then the value of Pq 2 in equilibrium with both A1 and A1 2 0 3 
can be calculated at any temperature when AG° is known. If the actual oxygen 
partial pressure exceeds the equilibrium pressure, then A1 ought to oxidize. If 
the reverse is true, A1 2 0 3 would be reduced to Al. At 1000 °C, AG° = -202 



1.5. Thermodynamics of Materials 


25 



Figure 1-10* Standard free energy of oxide formation vs. temperature: O Melting 
point of metal; □ boiling point of metal (1 atm). (Reprinted with permission from 
A. G. Guy, Introduction to Materials Science , McGraw-Hill, Inc., 1972). 


kcal and P 02 = 2 x 10“ 35 atm. Since this value is many orders of magnitude 
below actual oxygen partial pressures in vacuum systems, A1 would be 
expected to oxidize. It does to some extent, and a thin film of alumina probably 
forms on the surface of the molten aluminum source. Nevertheless, oxide-free 
films can be deposited in practice. 

Similar Ellingham plots of free energy of formation versus temperature exist 
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for sulfides, carbides, nitrides, and chlorides. In Chapter 4 we consider such a 
diagram for Si-H-Cl compounds, which is useful for the thermodynamic 
analysis of Si CVD. 

1.5.2. Phase Diagrams 

The most widespread method for representing the conditions of chemical 
equilibrium for inorganic systems as a function of initial composition, tempera¬ 
ture, and pressure is through the use of phase diagrams. By phases we not only 
mean the solid, liquid, and gaseous states of pure elements and compounds but 
a material of variable yet homogeneous composition, such as an alloy, is also a 
phase. Although phase diagrams generally contain a wealth of thermodynamic 
information on systems in equilibrium, they can readily be interpreted without 
resorting to complex thermodynamic laws, functions, or equations. They have 
been experimentally determined for many systems by numerous investigators 
over the years and provide an invaluable guide when synthesizing materials. 

There are a few simple rules for analyzing phase diagrams. The most 
celebrated of these is the Gibbs phase rule, which, though deceptively simple, 
is arguably the most important linear algebraic equation in physical science. It 
can be written as 

f=n + 2 - tf, (1-18) 

where n is the number of components (i.e., different atomic species), ^ is the 
number of phases, and / is the number of degrees of freedom or variance in 
the system. The number of intensive variables that can be independently varied 
without changing the phase equilibrium is equal to /. 

7.5.2.7. One-Component System . As an application to a one-component 
system, consider the P-7 1 diagram given for carbon in Fig. 1-11. Shown are 
the regions of stability for different phases of carbon as a function of pressure 
and temperature. Within the broad areas, the single phases diamond and 
graphite are stable. Both P and T variables can be independently varied to a 
greater or lesser extent without leaving the single-phase field. This is due to the 
phase rule, which gives / = 1+2 - 1=2. Those states that lie on any of 
the lines of the diagram represent two-phase equilibria. Now / = 1 + 2 - 2 
= 1. This means, for example, that in order to change but maintain the 
equilibrium along the diamond-graphite line, only one variable, either T or 
P, can be independently varied; the corresponding variables P or T must 
change in a dependent fashion. At a point where three phases coexist (not 
shown), /= 0. Any change of T or P will destroy the three-phase equilib- 
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Figure 1-11. Portion of the pressure-temperature diagram for carbon showing stabil¬ 
ity regions of diamond and graphite. Shaded areas represent regions of diamond 
formation in the indicated metal solvents. (Reprinted with permission from R. C. 
DeVries, Ann. Rev. Mater. Sci. 17, 161, 1987). 

rium, leaving instead either one or two phases. The diagram suggests that 
pressures between 10 4 to 10 5 bars (~ 10,000-100,000 atm) are required to 
transform graphite into diamond. In addition, excessively high temperatures 
(~ 2000 K) are required to make the reaction proceed at appreciable rates. It 
is exciting, therefore, that diamond thin films have been deposited by decom¬ 
posing methane in a microwave plasma at low pressure and temperature, thus 
avoiding the almost prohibitive pressure-temperature regime required for bulk 
diamond synthesis. 



1.5.2.2. Two-Component Systems . When two elements or compounds are 
made to combine, many very important materials, less-well-known than the 
compounds of inorganic chemistry, can be produced. Binary metal alloys and 
compound semiconductors such as Ni-Cr (nichrome) and GaAs are examples 
that have important bulk as well as thin-film uses. The resultant phases that 
form as a function of initial reactant proportions and temperature are depicted 
on binary equilibrium phase diagrams. Collections of these have been pub¬ 
lished for metal, semiconductor, and ceramic systems and are among the most 
frequently consulted references in the field of materials. Unless noted other- 
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Figure 1-12. Ge-Si equilibrium phase diagram. (Reprinted with permission from 
M. Hansen, Constitution of Binary Alloys , McGraw-Hill, Inc. 1958). 

wise, these diagrams hold at atmospheric pressure, in which case the variance 
is reduced by 1. The Gibbs phase rule now states / =n+l-^or/=3- 
\j/. Thus, at most three phases can coexist in equilibrium. 

To learn how to interpret binary phase diagrams, let us first consider the 
Ge-Si system shown in Fig. 1-12. Such a system is interesting because of the 
possibility of creating semiconductors having properties intermediate to those 
of Ge and Si. On the horizontal axis, the overall composition is indicated. Pure 
Ge and Si components are represented at the extreme left and right axes, 
respectively, and compositions of initial mixtures of Ge and Si are located in 
between. Such compositions are given in either weight or atomic percent. The 
following set of rules will enable a complete equilibrium phase analysis for an 
initial alloy composition X Q heated to temperature 7^. 

1. Draw a vertical line at composition X Q . Any point on this line represents a 
state of this system at the temperature indicated on the left-hand scale. 

2. The chemical compositions of the resulting phases depend on whether the 
point lies (a) in a one-phase field, (b) in a two-phase field, or (c) on a 
sloping or horizontal (isothermal) boundary between phase fields. 

a. For states within a single-phase field., i.e., L (liquid), S (solid), or a 
compound, the phase composition or chemical analysis is always the 
same as the initial composition. 
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b. In a two-phase region, i.e., L + S, a + j3, etc., a horizontal tie line is 
first drawn through the state point extending from one end of the 
two-phase field to the other as shown in Fig. 1-12. On either side of the 
two-phase field are the indicated single-phase fields (L and S). The 
compositions of the two phases in question are given by projecting the 
ends of the tie line vertically down and reading off the values. For 
example, if X 0 = 40 at% Si and T 0 = 1200 °C, X L = 34 at% Si and 
X s = 67 at% Si. 

c. State points located on either a sloping or a horizontal boundary cannot 
be analyzed; phase analyses can only be made above or below the 
boundary lines according to rules a and b. Sloping boundaries are known 
as liquidus or solidus lines when L/L + S or L + S/S phase field 
combinations are respectively involved. Such lines also represent solu¬ 
bility limits and are, therefore, associated with the processes of solution 
or rejection of phases (precipitation) from solution. The horizontal 
isothermal boundaries indicate the existence of phase transformations 
involving three phases. The following common reactions occur at these 
critical isotherms, where a, 0 and y are solid phases: 

1. Eutectic: L a + 0 

2. Eutectoid: y -> a + 0 

3. Peritectic: L + a -► 7 

3. The relative amount of phases present depends on whether the state point 
lies in (a) a one-phase field or (b) a two-phase field. 

a. Here the one phase in question is homogeneous and present exclusively. 
Therefore, the relative amount of this phase is 100%. 

b. In the two-phase field the lever rule must be applied to obtain the 
relative phase amounts. From Fig. 1-12, state X 0 , T 0 , and the corre¬ 
sponding tie line, the relative amounts of L and S phases are given by 

-X 0 X 0 - X L 

%L = -F- 7 ^ X 100; %S = X 100, (1-19) 

X$ — X L X s a l 

where %L plus %S = 100. (Substitution gives %L = (67 - 40)/(67 - 34) x 
100 = 81.8, and %S = (40 - 34)/(67 - 34) X 100 = 18.2.) Equation 1-19 
represents a definition of the lever rule that essentially ensures conservation of 
mass in the system. The tie line and lever rule can be applied only in a 
two-phase region; they make no sense in a one-phase region. Such analyses do 
reveal information on phase compositions and amounts, yet they say nothing 
about the physical appearance or shape that phases actually take. Phase 
morphology is dependent on issues related to nucleation and growth. 
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Figure 1-13. Al-Si equilibrium phase diagram. (Reprinted with permission from 
M. Hansen, Constitution of Binary Alloys y McGraw-Hill. Inc. 1958). 

Before leaving the Ge-Si system, note that L represents a broad liquid 
solution field where Ge and Si atoms mix in all proportions. Similarly, at 
lower temperatures, Ge and Si atoms mix randomly but on the atomic sites of a 
diamond cubic lattice to form a substitutional solid solution. The lens-shaped 
L + S region separating the single-phase liquid and solid fields occurs in many 
binary systems, including Cu-Ni, Ag-Au, Pt-Rh, Ti-W, and Al 2 0 3 -Cr 2 0,. 

A very common feature on binary phase diagrams is the eutectic isotherm. 
The Al-Si system shown in Fig. 1-13 is an example of a system undergoing a 
eutectic transformation at 577 °C. Alloy films containing about 1 at% Si are 
used to make contacts to silicon in integrated circuits. The insert in Fig. 1-13 
indicates the solid-state reactions for this alloy involve either the formation of 
an Al-rich solid solution above 520 °C or the rejection of Si below this 
temperature in order to satisfy solubility requirements. Although this particu- 
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lar alloy cannot undergo a eutectic transformation, all alloys containing more 
than 1.59 at% Si can. When crossing the critical isotherm from high tempera¬ 
ture, the reaction 

L( 11.3 at% Si) 5 ^-> C Al(l .59 at% Si) + Si (1-20) 

occurs. Three phases coexist at the eutectic temperature, and therefore / = 0. 
Any change in temperature and/or phase composition will drive this very 
special three-phase equilibrium into single- (i.e., L) or two-phase fields (i.e., 
L + Al, L 4- Si, A1 -I- Si), depending on composition and temperature. 

The important GaAs system shown in Fig. 1-14 contains two independent 
side-by-side eutectic reactions at 29.5 and 810 °C. For the purpose of analysis 
one can consider that there are two separate eutectic subsystems, Ga-GaAs 
and GaAs-As. In this way complex diagrams can be decomposed into simpler 
units. The critical eutectic compositions occur so close to either pure compo¬ 
nent that they cannot be resolved on the scale of this figure. The prominent 
central vertical line represents the stoichiometric GaAs compound, which melts 
at 1238 °C. Phase diagrams for several other important 3-5 semiconductors, 

WEIGHT PERCENT ARSENIC 



Figure 1-14. Ga-As equilibrium phase diagram. (Reprinted with permission from 
M. Hansen, Constitution of Binary Alloys , McGraw-Hill, Inc. 1958). 
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(e.g., InP, GaP, and InAs) have very similar appearances. These compound 
semiconductors are common in other ways. For example, one of the compo¬ 
nents (e.g., Ga, In) has a low melting point coupled with a rather low vapor 
pressure, whereas the other component (e.g., As, P) has a higher melting point 
and a high vapor pressure. These properties complicate both bulk and thin-film 
single-crystal growth processes. 

We end this section on phase diagrams by reflecting on some distinctions in 
their applicability to bulk and thin-film materials. High-temperature phase 
diagrams were first determined in a systematic way for binary metal alloys. 
The traditional processing route for bulk metals generally involves melting at a 
high temperature followed by solidification and subsequent cooling to the 
ambient. It is a reasonable assumption that thermodynamic equilibrium is 
attained in these systems, especially at elevated temperatures. Atoms in metals 
have sufficient mobility to enable stable phases to nucleate and grow within 
reasonably short reaction times. This is not generally the case in metal oxide 
systems, however, because of the tendency of melts to form metastable glasses 
due to sluggish atomic motion. 

In contrast, thin films do not generally pass from a liquid phase through a 
vertical succession of phase fields. For the most part, thin-film science and 
technology is characterized by low-temperature processing where equilibrium 
is difficult to achieve. Depending on what is being deposited and the conditions 
of deposition, thin films possessing varying degrees of thermodynamic stability 
can be readily produced. For example, single-crystal silicon is the most stable 
form of this element below the melting point. Nevertheless, chemical vapor 
deposition of Si from chlorinated silanes at 1200 °C will yield single-crystal 
films, and amorphous films can be produced below 600 °C. In between, 
poly crystalline Si films of varying grain size can be deposited. Since films are 
laid down an atomic layer at a time, the thermal energy of individual atoms 
impinging on a massive cool substrate heat sink can be transferred to the latter 
at an extremely rapid rate. Deprived of energy, the atoms are relatively 
immobile. It is not surprising, therefore, that metastable and even amorphous 
semiconductor and alloy films can be evaporated or sputtered onto cool 
substrates. When such films are heated, they crystallize and revert to the more 
stable phases indicated by the phase diagram. 

Interesting issues related to binary phase diagrams arise with multicompo¬ 
nent thin films that are deposited in layered structures through sequential 
deposition from multiple sources. For example, “strained layer superlattices” 
of Ge-Si have been grown by molecular beam epitaxy (MBE) techniques (see 
Chapter 7). Films of Si and Si + Ge solid-solution alloy, typically tens of 
angstroms thick, have been sequentially deposited such that the resultant 
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composite film is a single crystal with strained lattice bonds. The resolution of 
distinct layers as revealed by the transmission electron micrograph of Fig. 
14-17 is suggestive of a two-phase mixture. On the other hand, a single crystal 
implies a single phase even if it possesses a modulated chemical composition. 
Either way, the superlattice is not in thermodynamic equilibrium because the 
Ge-Si phase diagram unambiguously predicts a stable solid solution at low 
temperature. Equilibrium can be accelerated by heating, which results in film 
homogenization by interatomic diffusion. In thin Films, phases such as solid 
solutions and compounds are frequently accessed horizontally across the 
phase diagram during an isothermal anneal. This should be contrasted with 
bulk materials, where equilibrium phase changes commonly proceed vertically 
downward from elevated temperatures. 


1 . 6 . Kinetics 


1.6.1. Macroscopic Transport 

Whenever a material system is not in thermodynamic equilibrium, driving 
forces arise naturally to push it toward equilibrium. Such a situation can occur, 
for example, when the free energy of a microscopic system varies from point 
to point because of compositional inhomogeneities. The resulting atomic 
concentration gradients generate time-dependent, mass-transport effects that 
reduce free-energy variations in the system. Manifestations of such processes 
include phase transformations, recrystallization, compound growth, and degra¬ 
dation phenomena in both bulk and thin-film systems. In solids, mass transport 
is accomplished by diffusion, which may be defined as the migration of an 
atomic or molecular species within a given matrix under the influence of a 
concentration gradient. Fick established the phenomenological connection 
between concentration gradients and the resultant diffusional transport through 
the equation 

dC 

J = - D— (1-21) 

dx 

The minus sign occurs because the vectors representing the concentration 
gradient dC/dx and atomic flux J are oppositely directed. Thus an increasing 
concentration in the positive x direction induces mass flow in the negative * 
direction, and vice versa. The units of C are typically atoms/cm 3 . The 
diffusion coefficient D, which has units of cm 2 /sec, is characteristic of both 
the diffusing species and the matrix in which transport occurs. The extent of 
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observable diffusion effects depends on the magnitude of D. As we shall later 
note, D increases in exponential fashion with temperature according to a 
Maxwell-Boltzmann relation; i.e., 

D = D 0 ex p - E d /RT , (1-22) 


where D 0 is a constant and RT has the usual meaning. The activation energy 
for diffusion is E D (cal/mole). 

Solid-state diffusion is generally a slow process, and concentration changes 
occur over long periods of time; the steady-state condition in which concentra¬ 
tions are time-independent rarely occurs in bulk solids. Therefore, during 
one-dimensional diffusion, the mass flux across plane x of area A exceeds 
that which flows across plane x + dx. Atoms will accumulate with time in the 
volume A dx , and this is expressed by 


I dJ \ dJ dc 

JA - 7 + — dx \ A = - A dx = —A dx. 

\ dx ) dx dt 


(1-23) 


Substituting Eq. 
gives 


1-21 and assuming that D is a constant independent of C or j 


dC(x,t) d 2 C(x, t) 
dt D $ x 2 


(1-24) 


The non-steady-state heat conduction equation is identical if temperature is 
substituted for C and the thermal diffusivity for D. Many solutions for both 
diffusion and heat conduction problems exist for media of varying geometries, 
constrained by assorted initial and boundary conditions. They can be found in 
the books by Carslaw and Jaeger, and by Crank, listed in the bibliography. 
Since complex solutions to Eq. 1-24 will be discussed on several occasions 
(e.g., in Chapters 8, 9, and 13), we introduce simpler applications here. 

Consider an initially pure thick film into which some solute diffuses from 
the surface. If the film dimensions are very large compared with the extent of 
diffusion, the situation can be physically modeled by the following conditions: 

C(*,0)=0 at / = 0 foroo>x>0, (l-25a) 

C( oo, t) = 0 at .v = oo for t > 0. (l-25b) 

The second boundary condition that must be specified has to do with the nature 
of the diffusant distribution maintained at the film surface x = 0. Two simple 
cases can be distinguished. In the first, a thick layer of diffusant provides an 
essentially limitless supply of atoms maintaining a constant surface concentra¬ 
tion C 0 for all time. In the second case, a very thin layer of diffusant provides 
an instantaneous source S 0 of surface atoms per unit area. Here the surface 
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concentration diminishes with time as atoms diffuse into the underlying film. 
These two cases are respectively described by 

C(O,0 = C o (l-26a) 

[ C(x,t)dx = S 0 (1 -26b) 

Expressions for C(x> t) satisfying these conditions are respectively 

C(,,,) = Cierfc^L- - C„(l - erf^). 0-27.) 
S x 2 

C(x,t)=: exp-, (l-27b) 

v ’ s/lTDi 4 Dt v ' 

and these represent the simplest mathematical solutions to the diffusion equa¬ 
tion. They have been employed to determine doping profiles and junction 



Figure 1-15. Normalized Gaussian and Erfc curves of C/C 0 vs. x/ y/4~Dt. Both 
logarithmic and linear scales are shown. (Reprinted with permission from John Wiley 
and Sons, from W. E. Beadle, J. C. C. Tsai, and R. D. Plummer, Quick Reference Manual 
for Silicon Integrated Circuit Technology , Copyright © 1985, Bell Telephone Laboratories 
Inc. Published by J. Wiley and Sons). 
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depths in semiconductors. The error function erf x/2 \[Di, defined by 


erf 


2-JWt 


2 

\/t 


L 


!/I s/Dl 


dZ, 


(1-28) 


is a tabulated function of only the upper limit or argument x/2 \fDi. 
Normalized concentration profiles for the Gaussian and Erfc solutions 
are shown in Fig. 1-15. It is of interest to calculate how these distributions 
spread with time. For the erfc solution, the diffusion front at the arbitrary 
concentration of C(jc, t)/C 0 = 1/2 moves parabolically with time as x = 
2 y/Dterfc~ 1 (\/2) or x = 0.96When \fDt becomes large compared 
with the film dimensions, the assumption of an infinite matrix is not valid and 
the solutions do not strictly hold. The film properties may also change 
appreciably due to interdiffusion. To limit the latter and ensure the integrity of 
films, D should be kept small, which in effect means the maintenance of low 
temperatures. This subject will be discussed further in Chapter 8. 


1.6.2. Atomistic Considerations 

Macroscopic changes in composition during diffusion are the result of the 
random motion of countless individual atoms unaware of the concentration 
gradient they have helped establish. On a microscopic level, it is sufficient to 
explain how atoms execute individual jumps from one lattice site to another, 
for through countless repetitions of unit jumps macroscopic changes occur. 
Consider Fig. l-16a, showing neighboring lattice planes spaced a distance a 0 
apart within a region where an atomic concentration gradient exists. If there 
are h, atoms per unit area of plane 1, then at plane 2, n 2 = + (dn /dx)a 0 . 
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Figure 1-16. (a) Atomic diffusion fluxes between neighboring crystal planes, (b) 

Atomistic view of atom jumping into a neighboring vacancy. 
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where we have taken the liberty of assigning a continuum behavior at discrete 
planes. Each atom vibrates about its equilibrium position with a characteristic 
lattice frequency v , typically 10 13 sec _1 . Very few vibrational cycles have 
sufficient amplitude to cause the atom to actually jump into an adjoining lattice 
position, thus executing a direct atomic interchange. This process would be 
greatly encouraged, however, if there were neighboring vacant sites. The 
fraction of vacant lattice sites was previously given by e~ £//kT (see Eq. 1-3). 
In addition, the diffusing atom must acquire sufficient energy to push the 
surrounding atoms apart so that it can squeeze past and land in the so-called 
activated state shown in Fig. 1-16b. This step is the precursor to the downhill 
jump of the atom into the vacancy. The number of times per second that an 
atom successfully reaches the activated state is ve~ £j/kT , where is the 
vacancy jump or migration energy per atom. Here the Boltzmann factor may 
be interpreted as the fraction of all sites in the crystal that have an activated 
state configuration. The atom fluxes from plane 1 to 2 and from plane 2 to 1 
are then, respectively, given as 

J \~2 = ^"exp - ~exp - ^( Ca o). (l-29a) 

1 Ef E: ( dC \ 

J 2-, = --exp- -exp- -(C+ — «„)»„. O-Mb) 


where we have substituted Ca 0 for n and used the factor of 1 /6 to account for 
bidirectional jumping in each of the three coordinate directions. The net flux 
J N is the difference or 


1 

6 


9 E f E j 

j n= ~ T^exp - —exp ~ Jf 



(1-30) 


By association with Fick’s law, D can be expressed as 

D = D 0 exp - E d /RT (1-31) 

with D 0 = (l/6)ao*' anc * E d ~ (e f + £j)N a% where N A is Avogadro’s num¬ 
ber. 

Although the above model is intended for atomistic diffusion in the bulk 
lattice, a similar expression for D would hold for transport through grain 
boundaries or along surfaces and interfaces of films. At such nonlattice sites, 
energies for defect formation and motion are expected to be less, leading to 
higher diffusivities. Dominating microscopic mass transport is the Boltzmann 
factor exp - E D /RT , which is ubiquitous when describing the temperature 
dependence of the rate of many processes in thin films. In such cases the 
kinetics can be described graphically by an Arrhenius plot in which the 



38 


A Review of Materials Science 


a. 




Figure 1-17. (a) Free-energy variation with atomic distance in the absence of an 

applied field, (b) Free-energy variation with atomic distance in the presence of an 
applied field. 


logarithm of the rate is plotted on the ordinate and the reciprocal of the 
absolute temperature is plotted along the abscissa. The slope of the resulting 
line is then equal to -E D /R f from which the characteristic activation energy 
can be extracted. 

The discussion to this point is applicable to motion of both impurity and 
matrix atoms. In the latter case we speak of self-diffusion. For matrix atoms 
there are driving forces other than concentration gradients that often result in 
transport of matter. Examples are forces due to stress fields, electric fields, 
and interfacial energy gradients. To visualize their effect, consider neighboring 
atomic positions in a crystalline solid where no fields are applied. The free 
energy of the system has the periodicity of the lattice and varies schematically, 
as shown in Fig. l-17a. Imposition of an external field now biases the system 
such that the free energy is lower in site 2 relative to 1 by an amount 2 AG. A 
free-energy gradient exists in the system that lowers the energy barrier to 
motion from 1 2 and raises it from 2 1. The rate at which atoms move 

from 1 to 2 is given by 


G d - AG 


sec 


r 12 = v exp 


RT 


(l-32a) 
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Similarly, 


/ G d + AG 

r2 ' = " exp l— nr- 


sec 


(l-32b) 


and the net rate r N is given by the difference or 


r N = v exp 


~RT 


AG 


AG 


exp-exp- 

F RT y RT 


AG 


= 2 " eXP “ -Rf Sinh ~RT' ^' 33 ^ 


When AG = 0, the system is in thermodynamic equilibrium and r N = 0, so 
no net atomic motion occurs. Although G D is typically a few electron volts or 
so per atom (1 eV = 23,060 cal/mole), AG is much smaller in magnitude 
since it is virtually impossible to impose external forces on solids comparable 
to the interatomic forces. In fact, AG/RT is usually much less than unity, so 
sinh AG/RT * AG/RT. This leads to commonly observed linear diffusion 
effects. But when AG/RT * 1 ? nonlinear diffusion effects are possible. By 
multiplying both sides of Eq. 1-33 by a 0 , we obtain the atomic velocity v: 

. = (1-34) 

a 0 KJ 


The term in brackets is essentially the diffusivity D with G D a diffusional 
activation energy. (The distinction between G D and E D need not concern us 
here.) The term 2 A G/a 0 is a measure of the molar free-energy gradient or 
applied force F. Therefore, the celebrated Nernst-Einstein equation results: 

v = DF/RT. (1-35) 

Application of this equation will be made subsequently to various thin-film 
mass transport phenomena, e.g., electric-field-induced atomic migration (elec¬ 
tromigration), stress relaxation, and grain growth. The drift of charge carriers 
in semiconductors under an applied field can also be modeled by Eq. 1-35. In 
some instances, larger generalized forces can be applied to thin films relative 
to bulk materials because of the small dimensions involved. 

Chemical reaction rate theory provides a common application of the preced¬ 
ing ideas. In Fig. 1-18 the reactants at the left are envisioned to proceed 
toward the right following the reaction coordinate path. Along the way, 
intermediate activated states are accessed by surmounting the free-energy 
barrier. Through decomposition of the activated species, products form. If C R 
is the concentration of reactants at coordinate position 1 and C P the concentra¬ 
tion of products at 2, then the net rate of reaction is proportional to 

I G*\ I G* +AG \ , x 

r s * Q,expl - I ~ C/,exp| -^— J . (1-36) 
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COORDINATE 

Figure 1-18. Free-energy path for thermodynamically favored reaction 1 2. 


where G* is the free energy of activation. As before, the Boltzmann factors 
represent the probabilities of surmounting the respective energy barriers faced 
by reactants proceeding in the forward direction, or products in the reverse 
direction. When chemical equilibrium prevails, the competing rates are equal 
and r N = 0. Therefore, 

C p AG exp - G P /RT 

— = exp- = -. 

C R RT exp - G r /RT 

For the reaction to proceed to the right A G = G R — G P must be positive. 
By comparison with Eq. 1-12, it is apparent that the left-hand side is the 
equilibrium constant and AG may be associated with - AG°. This expression 
is perfectly general, however, and applies, for example, to electron energy-level 
populations in semiconductors and lasers, as well as magnetic moment distribu¬ 
tions in solids. In fact, whenever thermal energy is a source of activation 
energy, Eq. 1-37 is valid. 



1-7. Nucleation 


When the critical lines separating stable phase fields on equilibrium phase 
diagrams are crossed, new phases appear. Most frequently, a decrease in 
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temperature is involved, and this may, for example, trigger solidification or 
solid-state phase transformations from now unstable melts or solid matrices. 
When such a transformation occurs, a new phase of generally different 
structure and composition emerges from the prior parent phase or phases. The 
process known as nucleation occurs during the very early stages of phase 
change. It is important in thin films because the grain structure that ultimately 
develops in a given deposition process is usually strongly influenced by what 
happens during film nucleation and subsequent growth. 

Simple models of nucleation are first of all concerned with thermodynamic 
questions of the energetics of the process of forming a single stable nucleus. 
Once nucleation is possible, it is usual to try to specify how many such stable 
nuclei will form within the system per unit volume and per unit time—i.e., 
nucleation rate. As an example, consider the homogeneous nucleation of a 
spherical solid phase of radius r from a prior supersaturated vapor. Pure 
homogeneous nucleation is rare but easy to model since it occurs without 
benefit of complex heterogeneous sites such as exist on an accommodating 
substrate surface. In such a process the gas-to-solid transformation results in a 
reduction of the chemical free energy of the system given by ( 4 / 3 ) 7 rr 3 AG K , 
where A G v corresponds to the change in chemical free energy per unit 
volume. For the condensation reaction vapor (v) -► solid (s), Eq. 1-13 indi¬ 
cates that 

kT P s kT P v , 

A G v = —In— =-In—, (1-38) 

y Q P v fi P s y ’ 

where P s is the vapor pressure above the solid, P v is the pressure of the 

supersaturated vapor, and Q is the atomic volume. A more instructive way to 
write Eq. 1-38 is 

AG^ = — [kT /fi)ln(l + S), (1-39) 

where S is the vapor supersaturation defined by (P v - P s )/P s . Without 
supersaturation, AG K is zero and nucleation is impossible. In our example, 
however, P v > P s and A G v is negative, which is consistent with the notion of 
energy reduction. Simultaneously, new surfaces and interfaces form. This 
results in an increase in the surface free energy of the system given by 47 rr 2 7 , 
where 7 is the surface energy per unit area. The total free-energy change in 
forming the nucleus is thus given by 

AG = (4/3)7rr 3 AG K + 47r/* 2 7, (1^40) 

and minimization of AG with respect to r yields the equilibrium size of 
r = r*. Thus, dAG/dr = 0, and r * = -2y/AG V . Substitution in Eq. \-40 
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Figure 1-19. Free-energy change (AG) as a function of cluster (r* > r) or stable 
nucleus (r > r*) size, r* is critical nucleus size, and AG* is critical free-energy 
barrier for nucleation. 

gives AG* = 16 tt7 3 /3(A G v ) 2 . The quantities r* and AG* are shown in Fig. 
1-19, where it is evident that AG* represents an energy barrier to the 
nucleation process. If a solid-like spherical cluster of atoms momentarily forms 
by some thermodynamic fluctuation, but with radius less than r*, the cluster is 
unstable and will shrink by losing atoms. Clusters larger than r* have sur¬ 
mounted the nucleation energy barrier and are stable. They tend to grow 
larger while lowering the energy of the system. 

The nucleation rate N is essentially proportional to the product of three 
terms, namely, 

N = N*A *co (nuclei/cm 2 -sec). (l-4l) 

TV* is the equilibrium concentration (per cm 2 ) of stable nuclei, and co is the 
rate at which atoms impinge (per cm 2 -sec) onto the nuclei of critical area A *. 
Based on previous experience of associating the probable concentration of an 
entity with its characteristic energy through a Boltzmann factor, it is appropri¬ 
ate to take TV* = n s e~ AG * /kT , where n s is the density of all possible nuclea¬ 
tion sites. The atom impingement flux is equal to the product of the concentra¬ 
tion of vapor atoms and the velocity with which they strike the nucleus. In the 
next chapter we show that this flux is given by a(P v — P S )N A / VlirMRT, 
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where M is the atomic weight and a is the sticking coefficient. The nucleus 
area is simply 47rr 2 , since gas atoms impinge over the entire spherical surface. 
Upon combining terms, we obtain 


N 


« s exP - 


AG*^ 2 a(P v -P s )N A 
-47r r . =— 

kT V2 ttMRT 


(1-42) 


The most influential term in this expression is the exponential factor. It 
contains AG*, which is, in turn, ultimately a function of S. When the vapor 
supersaturation is sufficiently large, homogeneous nucleation in the gas is 
possible. This phenomenon causes one of the more troublesome problems 
associated with chemical vapor deposition processes since the solid particles 
that nucleate settle on and are incorporated into growing films destroying their 
integrity. 

Heterogeneous nucleation of films is a more complicated subject in view of 
the added interactions between deposit and substrate. The nucleation sites in 
this case are kinks, ledges, dislocations, etc., which serve to stabilize nuclei of 
differing size. The preceding capillarity theory will be used again in Chapter 5 
to model heterogeneous nucleation processes. Suffice it to say that when N is 
high during deposition, many crystallites will nucleate and a fine-grained film 
results. On the other hand, if nucleation is suppressed, conditions favorable to 
single-crystal growth are fostered. 


1.8. Conclusion 


At this point we conclude this introductory sweep through several relevant 
topics in materials science. If the treatment of structure, bonding, thermody¬ 
namics, and kinetics has introduced the reader to or elevated his or her prior 
awareness of these topics, it has served the intended purpose. Threads of this 
chapter will be woven into the subsequent fabric of the discussion on the 
preparation and properties of thin films. 



Exercises 



1. An FCC film is deposited on the (100) plane of a single-crystal FCC 
substrate. It is determined that the angle between the [100] directions in 
the film and substrate is 63.4°. What are the Miller indices of the plane 
lying in the film surface? 
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2 . Both Au, which is FCC, and W, which is body-centered cubic (BCC) 
have a density of 19.3 g/cm 3 . Their respective atomic weights are 197.0 
and 183.9. 

a. What is the lattice parameter of each metal? 

b. Assuming both contain hard sphere atoms, what is the ratio of their 
diameters? 

3 - a. Comment on the thermodynamic stability of a thin-film superlattice 
composite consisting of alternating Si and Ge 0 4 Si 0 6 film layers shown 
in Fig. 14-17 given the Ge-Si phase diagram (Fig. 1-12). 
b. Speculate on whether the composite is a single phase (because it is a 
single crystal) or consists of two phases (because there are visible film 
interfaces). 

_ o 

4. Diffraction of 1.5406-A X-rays from a crystallographically oriented 
(epitaxial) relaxed bilayer consisting of AJAs and GaAs yields two closely 
spaced overlapping peaks. The peaks are due to the (111) reflections from 

o 

both films. The lattice parameters are a 0 (AlAs) = 5.6611 A and 

o 

tf 0 (GaAs) = 5.6537 A. What is the peak separation in degrees? 

5. The potential energy of interaction between atoms in an ionic solid as a 
function of separation distance is given by V{r) = - A/r+Br ' n , 
where A, B y and n are constants. 

a. Derive a relation between the equilibrium lattice distance a 0 and A , 
B y and n. 

b. The force constant between atoms is given by K s = d 2 V / dr 2 | r . 
If Young’s elastic modulus (in units of force/area) is essentially given 
by K s /a 0 , show that it varies as a q 4 in ionic solids. 

6 . What is the connection between the representations of electron energy in 
Figs. l-8a and 1-9? Illustrate for the case of an insulator. If the material 
in Fig. l-8a were compressed, how would E g change? Would the 
electrical conductivity change? How? 

7- A 75 8Lt% Ga-25 at% As melt is cooled from 1200 °C to 0 °C in a 
crucible. 

a. Perform a complete phase analysis of the crucible contents at 1200 °C, 
1000 °C, 600 °C, 200 °C, 30 °C, and 29 °C. What phases are 
present? What are their chemical compositions, and what are the 
relative amounts of these phases? Assume equilibrium cooling. 
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b. A thermocouple immersed in the melt records the temperature as the 
crucible cools. Sketch the expected temperature-time cooling re¬ 
sponse. 

c. Do a complete phase analysis for a 75 at% As-25 at% Ga melt at 1000 
°C, 800 °C, and 600 °C. 

8 . A quartz (Si0 2 ) crucible is used to contain Mg during thermal evapora¬ 
tion in an effort to deposit Mg thin films. Is this a wise choice of crucible 
material? Why? 

9- A solar cell is fabricated by diffusing phosphorous (N dopant) from a 
constant surface source of 10 20 atoms/cm 3 into a P-type Si wafer 
containing 10 16 B atoms/cm 3 . The diffusivity of phosphorous is 10“ 12 
cm 2 /sec, and the diffusion time is 1 hour. How far from the surface is the 
junction depth—i.e., where Cn — 

10 . A brass thin film of thickness d contains 30 wt% Zn in solid solution 
within Cu. Since Zn is a volatile species, it readily evaporates from the 
free surface (x = d) at elevated temperature but is blocked at the 
substrate interface, x = 0. 

a. Write boundary conditions for the Zn concentration at both film 
surfaces. 

b. Sketch a time sequence of the expected Zn concentration profiles 
across the film during dezincification. (Do not solve mathematically.) 

11 - Measurements on the electrical resistivity of Au films reveal a three- 
order-of-magnitude reduction in the equilibrium vacancy concentration as 
the temperature drops from 600 to 300 °C. 

a. What is the vacancy formation energy? 

b. What fraction of sites will be vacant at 1080 °C? 

12. During the formation of Si0 2 for optical fiber fabrication, soot particles 
500 A in size nucleate homogeneously in the vapor phase at 1200 °C. If 
the surface energy of Si0 2 is 1000 ergs/cm 2 , estimate the value of the 
supersaturation present. 

13. An ancient recipe for gilding bronze statuary alloyed with small amounts 
of gold calls for the following surface modification steps. 

(1) Dissolve surface layers of the statue by applying weak acids (e.g., 
vinegar). 
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(2) After washing and drying, heat the surface to as high a temperature as 
possible but not to the point where the statue deforms or is damaged. 

(3) Repeat step 1. 

(4) Repeat step 2. 

(5) Repeat this cycle until the surface attains the desired golden appear¬ 
ance. 

Explain the chemical and physical basis underlying this method of 
gilding. 
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Vacuum Science 
and Technology 


Virtually all thin-film deposition and processing methods as well as techniques 
employed to characterize and measure the properties of films require a vacuum 
or some sort of reduced-pressure environment. For this reason the relevant 
aspects of vacuum science and technology are discussed at this point. It is also 
appropriate in a broader sense because this subject matter is among the most 
undeservedly neglected in the training of scientists and engineers. This is 
surprising in view of the broad interdisciplinary implications of the subject and 
the ubiquitous use of vacuum in all areas of scientific research and technologi¬ 
cal endeavor. The topics treated in this chapter will, therefore, deal with: 

2.1. Kinetic Theory of Gases 

2.2. Gas Transport and Pumping 

2.3. Vacuum Pumps and Systems 


2.1. Kinetic Theory of Gases 


2.1.1. Molecular Velocities 

The well-known kinetic theory of gases provides us with an atomistic picture 
of the state of affairs in a confined gas (Refs. 1, 2). A fundamental assumption 
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is that the large number of atoms or molecules of the gas are in a continuous 
state of random motion, which is intimately dependent on the temperature of 
the gas. During their motion the gas particles collide with each other as well as 
with the walls of the confining vessel. Just how many molecule-molecule or 
molecule-wall impacts occur depends on the concentration or pressure of the 
gas. In the perfect or ideal gas approximation, there are no attractive or 
repulsive forces between molecules. Rather, they may be considered to behave 
like independent elastic spheres separated from each other by distances that are 
large compared with their size. The net result of the continual elastic collisions 
and exchange of kinetic energy is that a steady-state distribution of molecular 
velocities emerges given by the celebrated Maxwell-Boltzmann formula 


1 dn 

m = ~~r 

n dv 


4 


M 

2RT 


3/2 

1 u 2 exp - 


Mv 2 

Trt’ 


( 2 - 1 ) 


This centerpiece of the kinetic theory of gases states that the fractional number 
of molecules /(u), where n is the number per unit volume in the velocity 
range i> to u + dv, is related to their molecular weight (M) and absolute 
temperature (7). In this formula the units of the gas constant R are on a 
per-mole basis. 

Among the important implications of Eq. 2-1, which is shown plotted in Fig. 
2-1, is that molecules can have neither zero nor infinite velocity. Rather, the 
most probable molecular velocity in the distribution is realized at the maximum 



Figure 2-1. Velocity distributions for A1 vapor and H 2 gas. (Reprinted with permis¬ 
sion from Ref. 1). 







2.1. Kinetic Theory of Gases 


51 


value of f{v) and can be calculated from the condition that df{v)/dv = 0. 
Since the net velocity is always the resultant of three rectilinear components 
u*, v y , and u z , one or even two, but of course not all three, of these may be 
zero simultaneously. Therefore, a similar distribution function of molecular 
velocities in each of the component directions can be defined; i.e., 


/K) 


}_dn , 

n dv x 


M 1/2 Mvl 

- exp-, 

2ttRT v 2 RT 


( 2 - 2 ) 


and similarly for the y and z components. 

A number of important results emerge as a consequence of the foregoing 
equations. For example, the most probable (u m ), average (u), and mean square 
(v 2 ) velocities are given, respectively, by 


= 


2 RT 


M 


f = 



8 RT 
irM 


(2-3a) 


(2-3b) 


V 


2 





(2-3c) 


These velocities, which are noted in Fig. 2-1, simply depend on the 
molecular weight of the gas and the temperature. In air at 300 K, for example, 
the average molecular velocity is 4.6 X 10 4 cm/sec, which is almost 1030 
miles per hour. However, the kinetic energy of any collection of gas mole¬ 
cules is_solely dependent on temperature. For a mole quantity it is given by 
(1/2 )Mv 2 = (3/2 )RT with (1/2 )RT partitioned in each of the coordinate 
directions. 


2.1.2. Pressure 

Momentum transfer from the gas molecules to the container walls gives rise to 
the forces that sustain the pressure in the system. Kinetic theory shows that the 
gas pressure P is related to the mean-square velocity of the molecules and. 
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thus, alternatively to their kinetic energy or temperature. Thus, 


P = 


1 nM— 


3 N a 


nRT 


(2-4) 


where N A is Avogadro’s number. From the definition of n it is apparent that 
Eq. 2-4 is also an expression for the perfect gas law. Pressure is the most 
widely quoted system variable in vacuum technology, and this fact has 
generated a large number of units that have been used to define it under 
various circumstances. Basically, two broad types of pressure units have arisen 
in practice. In what we shall call the scientific system (or coherent unit system 
(Ref. 2)), pressure is defined as the rate of change of the normal component of 
momentum of impinging molecules per unit area of surface. Thus, the pressure 
is normally defined as a force per unit area, and examples of these units are 
dynes/cm 2 (CGS) or newtons/meter 2 (N/m 2 ) (MKS). Vacuum levels are now 
commonly reported in SI units or pascals; 1 pascal (Pa) = 1 N/m 2 . Histori¬ 
cally, however, pressure was, and still is, measured by the height of a column 
of liquid, e.g., Hg or H 2 0. This has led to a set of what we shall call practical 
or noncoherent units such as millimeters and microns of Hg, torr, atmo¬ 
spheres, etc., which are still widely employed by practitioners as well as by 
equipment manufacturers. Definitions of some units together with important 
conversions include 

1 atm = 1.013 X 10 6 dynes/cm 2 = 1.013 X 10 5 N/m 2 = 1.013 x 10 5 Pa 

1 torr = 1 mm Hg = 1.333 X 10 3 dynes/cm 2 = 133.3 N/m 2 = 133.3 Pa 

1 bar = 0.987 atm = 750 torr. 

The mean distance traveled by molecules between successive collisions, 
called the mean-free path X^, is an important property of the gas that 
depends on the pressure. To calculate X^, we note that each molecule 
presents a target area ird 2 c to others, where d c is its collision diameter. A 
binary collision occurs each time the center of one molecule approaches within 
a distance d c of the other. If we imagine the diameter of one molecule 
increased to 2 d c while the other molecules are reduced to points, then in 
traveling a distance X^ the former sweeps out a cylindrical volume 7T£/ 2 X m ^. 
One collision will occur under the conditions ird^X^^n = 1. For air at room 
temperature and atmospheric pressure, X^ * 500 A, assuming d c « 5 A. 

A molecule collides in a time given by \ mfp / v and under the previous 
conditions, air molecules make about 10 10 collisions per second. This is why 
gases mix together rather slowly even though the individual molecules are 
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moving at great speeds. The gas particles do not travel in uninterrupted linear 
trajectories. As a result of collisions, they are continually knocked to and fro, 
executing a zigzag motion and accomplishing little net movement. Since n is 
directly proportional to P, a simple relation for ambient air is 

\n fp * 5 X 1 0~ 3 /P, (2-5) 

with given in cm and P in torr. At pressures below 10" 3 torr, is so 
large that molecules effectively collide only with the walls of the vessel. 


2.1.3. Gas Impingement on Surfaces 

A most important quantity that plays a role in both vacuum science and vapor 
deposition is the gas impingement flux <t>. It is a measure of the frequency with 
which molecules impinge on, or collide with, a surface, and should be 
distinguished from the previously discussed molecular collisions in the gas 
phase. The number of molecules that strike an element of surface (perpendicu¬ 
lar to a coordinate direction) per unit time and area is given by 


$ = 



( 2 - 6 ) 


Upon substitution of Eq. 2-2, we get 
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<f> = - 

(2tt) 
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~RT 


1/2 
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^exp - 


Mvl / RT 

-- dv x = n\ - . 

2 RT * V 2 irM 


(2-7) 


The use of Eq. 2-3b yields = (1/4 )nv, and substitution of the perfect gas 
law (Eq. 2-4) converts into the more recognizable form 

$/AT a = Pj VlTtMRT moles/cm 2 -sec. (2-8) 

A useful expansion of this formula is 

P 

$ = 3.513 X 10 22 —^===- molecules/cm 2 -sec, (2-9) 

when P is expressed in torr. 

As an application of the foregoing development, consider the problem of gas 
escaping the vessel through a hole of area A into a region where the gas 
concentration is zero. The rate at which molecules leave is given by and 
this corresponds to a volume flow per second (V) given by <PA /n cm 3 /sec. 
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Upon substitution of Eqs. 2-4 and 2-9, we have 


K= 3.64 X 10 3 (T/M ) 1/2 A cm 3 /sec. (2-10) 

For air at 298 K this corresponds to 11.7^4 L/sec, where A has units of cm 2 . 
In essence, we have just calculated what is known as the conductance of a 
circular aperture, a quantity that will be utilized later because of its signifi¬ 
cance in pumping gases. 

As a second application, consider the question of how long it takes for a 
surface to be coated by and contaminated with a monolayer of gas molecules. 
This issue is highly important when one attempts to deposit or grow films 
under extremely clean conditions. The same concern arises during surface 
analysis of films performed at very low pressures to minimize surface contami¬ 
nation arising from the vacuum chamber environment. Here one must make 
certain that the analysis time is shorter than that required for impurities to 
accumulate. This characteristic contamination time t c is essentially the inverse 
of the impingement flux. Thus, for complete monolayer coverage of a surface 
containing some 10 15 atoms/cm 2 , the use of Eq. 2-9 yields 


10 ' 


(MT) 


1/2 


= 2.85 x 10 


( mtY' 2 


/,-» A 1 \ 


with P measured in torr. In air at atmospheric pressure and ambient tempera¬ 
ture a surface will acquire a monolayer of gas in 3.49 x 10“ 9 sec, assuming 
all impinging atoms stick. On the other hand, at 10“ 10 torr a surface will stay 
clean for about 7.3 h. 

A condensed summary of the way system pressure affects the gas density, 
mean-free path, incidence rates, and monolayer formation times is conve¬ 
niently displayed in Fig. 2-2. The pressure range spanned in all thin-film 
research, development, and technological activities discussed in this book is 
over 13 orders of magnitude. Since all of the quantities depicted vary directly 
(on the left-hand axis) or inversely (on the right-hand axis) with pressure, a 
log-log plot connecting the variables is linear with a slope of 1. The pressure 
scale is arbitrarily subdivided into corresponding low-, medium-, high-, and 
ultrahigh-vacuum domains, each characterized by different requirements with 
respect to vacuum hardware (e.g., pumps, gauges, valves, gaskets, feed¬ 
throughs, etc.). Of the film deposition processes, evaporation requires a 
vacuum between the high and ultrahigh regimes, whereas sputtering and 
low-pressure chemical vapor deposition are accomplished at the border be¬ 
tween the medium- and high-vacuum ranges. Of the analytical instruments, 
electron microscopes operate in high vacuum, and surface analytical equipment 
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Figure 2-2. Molecular density, incidence rate, mean-free path, and monolayer for¬ 
mation time as a function of pressure. (Reprinted with permission from Ref. 2). 

have the most stringent cleanliness requirements and are operative only under 
ultrahigh-vacuum conditions. 


2 . 2 . Gas Transport and Pumping 


2.2.1. Gas Flow Regimes 

In order to better design, modify, or appreciate reduced-pressure systems, one 
must understand concepts of gas flow (Refs. 2-4). An incomplete understand¬ 
ing of gas flow limitations frequently results in less efficient system perfor¬ 
mance as well as increased expense. For example, a cheap piece of tubing 
having the same length and diameter dimensions as the diffusion pump to 
which it is attached will cut the pumping speed of the latter to approximately 
half of its rated value. In addition to the effectively higher pump cost, a 
continuing legacy of such a combination will be the longer required pumping 
time to reach a given level of vacuum each time the system is operated. 

Whenever there is a net directed movement of gas in a system under the 
influence of attached pumps, gas flow is said to occur. Under such conditions 
the gas experiences a pressure drop. The previous discussion on kinetic theory 
of gases essentially assumed an isolated sealed system. Although gas molecules 
certainly move, and with high velocity at that, there is no net gas flow and no 
establishment of pressure gradients in such a system. When gas does flow, 
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however, it is appropriate to distinguish between different regimes of flow. 
These regimes depend on the geometry of the system involved as well as the 
pressure, temperature, and type of gas under consideration. At one extreme we 
have free molecular flow, which occurs at low gas densities. The chambers of 
high-vacuum evaporators and analytical equipment, such as Auger electron 
spectrometers and electron microscopes, operate within the molecular flow 
regime. Here the mean-free path between intermolecular collisions is large 
compared with the dimensions of the system. Kinetic theory provides an 
accurate picture of molecular motion under such conditions. At higher pres¬ 
sures the mean-free path is reduced, and successive intermolecular collisions 
predominate relative to collisions with the walls of the chamber. At this 
extreme the so-called viscous flow regime is operative. An important example 
of such flow occurs in atmospheric chemical vapor deposition reactors. Com¬ 
pared with molecular flow, viscous flow is quite complex. At low gas 
velocities the flow is laminar where layered, parallel flow lines may be 
imagined. At the walls of a tube, for example, the laminar flow velocity is 
zero, but it increases to a maximum at the axis. At higher velocity the flowing 
gas layers are no longer parallel but swirl and are influenced by any obstacles 
in the way. In this turbulent flow range, cavities of lower pressure develop 
between layers. More will be said about viscous flow in Chapter 4. 

Criteria for distinguishing between the flow regimes are based on the 
magnitude of the Knudsen number Kn , which is defined by the ratio of the 



PRESSURE {Torr) 

Figure 2-3. Dominant gas flow regimes as a function of system dimensions and 
pressure. 
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chamber (pipe) diameter D p to gas mean-free path, i.e., Kn = D^/X^*. 
Thus for 

molecular flow Kn < 1, (2-12a) 

intermediate flow 1 < AT/i < 110, (2-12b) 

viscous flow Kn > 110. (2-12c) 

In air these limits can be alternatively expressed by D p P < 5 x 10“ 3 cm-torr 
for molecular flow, and D p P > 5 X 10“ 1 cm-torr in the case of viscous flow 
through the use of Eq. 2-5. Figure 2-3 serves to map the dominant flow 
regimes on this basis. Note that flow mechanisms may differ in various parts of 
the same system. Thus, although molecular flow will occur in the high-vacuum 
chamber, the gas may flow viscously in the piping near the exhaust pumps. 

2.2.2. Conductance 

Let us reconsider the molecular flow of gas through an orifice of area A that 
now separates two large chambers maintained at low pressures, P, and P 2 . 
From a phenomenological standpoint, a flow driven by the pressure difference 
is expected; i.e., 

Q=C(P,-P 2 ). (2-13) 

Here Q is defined as the gas throughput with units of pressure x volume per 
second (e.g., torr-L/sec). The constant of proportionality C is known as the 
conductance and has units of L/sec. Alternately, viewing flow through the 
orifice in terms of kinetic theory, we note that the molecular impingements in 
each of the two opposing directions do not interfere with each other. There¬ 
fore, the net gas flow at the orifice plane is given by the difference, or 
(*, ~ * 2 )A. 

Through the use of Eq. 2-10 it is easily shown that the conductance of the 
orifice is 

C = 3M^T/M A or 11.7/1 L/sec 
for air at 298 K, Note in the choice of terms the analogy to electrical circuits. 
If P, - P 2 is associated with the voltage difference, Q may be viewed as a 
current. Conductances of other components where the gas flow is in the 
molecular regime can be similarly calculated or measured. Results for a 
number of important geometric shapes are given in Fig. 2A (Ref. 5). Note that 
C is simply a function of the geometry for a specific gas at a given 
temperature. This is not true of viscous flow, where C also depends on 


Note that Kn is also defined as X^/ D p in the literature. 
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Figure 2-4. Conductances of various geometric shapes for molecular flow of air at 
25°C. Units of C are L/sec. (Reprinted with permission from Ref. 5). 


pressure. When conductances are joined in series, the system conductance C sys 
is given by 



(2-14) 


Clearly C sys 
in parallel 


is lower than that of any individual conductance. When connected 
c sys = C, + c 2 + C 3 + • • • . (2-15) 


As an example, (Ref. 6), consider the conductance of the cold trap assembly 
of Fig. 2-5 that isolates a vacuum system above from the pump below. 
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Figure 2-5. Cold trap assembly. (Adapted from R. W. Roberts, An Outline of 
Vacuum Technology , G.E. Report No. 64-RL-3394C, 1964, with permission from 
General Electric Company). 

Contributions to the total conductance come from 
C\ = conductance of aperture of 10 cm diameter 

= U.1A = 11.7ir(5) 2 = 919 L/sec 
C 2 = conductance of pipe 3 cm long 

= 12.2 D 3 /L = 12.2(l0) 3 /3 = 4065 L/sec 
C 3 - conductance of annular aperture 

= 11.7/1 ann = 11.7(0.25)ir(10 2 - 8 2 ) = 331 L/sec 
C 4 = conductance of annular pipe 

. (D 2 - D i ) 2 (D ] + D 2 ) 12.2(10 - 8) 2 (10 + 8) _ _ , 

= 12.2- = - = 58.6 L/sec 

L 15 

C 5 = C 2 - 4065 L/sec 

C 6 = conductance of aperture in end of pipe/diffusion pump 
AA 0 11.7ir(2.5) 2 7r5 2 

A - A 0 7r(5 2 - (2.5) 2 ) 


= 11.7 


= 303 L/sec 
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Therefore, 

1 A 1 

—- = L, — and C Total = 40 L/sec 

'-'Total 

upon evaluation. Strictly speaking, C 3 and C 4 should be multiplied by a 
correction factor of 1.27, which would have the effect of increasing C Tota , to 
51.1 L/sec. As we shall soon see, it is always desirable to have as large a 
conductance as possible. Clearly, the overall conductance is severely limited in 
this case by the annular region between the concentric pipes. 

2.2.3. Pumping Speed 

Pumping is the process of removing gas molecules from the system through the 
action of pumps. The pumping speed 5 is defined as the volume of gas passing 
the plane of the inlet port per unit time when the pressure at the pump inlet is 
P. Thus, 

S = Q/P. (2-16) 

Although the throughput Q can be measured at any plane in the system, P and 
S refer to quantities measured at the pump inlet. 

Although conductance and pumping speed have the same units and may even 
be equivalent numerically, they have different physical meanings. Conductance 
implies a component of a given geometry across which a pressure differential 
exists. Pumping speed refers to any plane that may be considered to be a pump 
for preceding portions of the system. To apply these ideas, consider a pipe of 
conductance C connecting a chamber at pressure P to a pump at pressure P p 
as shown in Fig. 2-6a. Therefore, Q = C(P - P p ). Elimination of Q through 



Figure 2-6. Chamber-pipe-pump assembly: (a) no outgassing; (b) with outgassing. 
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the use of Eq. 2-16 yields 


S = 


1 + S P /C’ 


(2-17) 


where S p is the intrinsic speed at the pump inlet (S p = Q/P p ) and S is the 
effective pumping speed at the base of the chamber. The latter never exceeds 
S p or C and is, in fact, limited by the smaller of these quantities. If, for 

example, C = S p in magnitude, then S = S p /2 and the effective pumping 

speed is half the rated value for the pump. The lesson, therefore, is to keep 
conductances large by making ducts between the pump and chamber as short 
and wide as possible. 

Real pumps outgas or release gas into the system as shown in Fig. 2-6b. 
Account may be taken of this by including an oppositely directed extra 
throughput term Q p such that 

I O n \ 

Q = S p P - Q p = SpPl 1 - I. (2-18) 

When Q = 0, the ultimate pressure of the pump, P 0 , is reached and Q p = 

S p P 0 . The effective pumping speed is then 

S = Q/P = S p (l-P 0 /P), (2-19) 


and falls to zero as the ultimate pressure of the pump is reached. 

An important issue in vacuum systems is the time required to achieve a 
given pressure. The pump-down time can be calculated by noting that the 
throughput may be defined as the time (t) derivative of the product of volume 
and pressure; i.e., Q = -d(VP)/dt = - V{dP/dt). Employing Eq. 2-18, 
we write 


- V 


dP{t) 

dt 


= S p P-Q p , 


where Q p includes pump as well as chamber outgassing. Upon integration 


P(t)-Q p /S p P(t)-P 0 S p t 

--- = - = exp-, 

Pi-Q,/s p Pi-Po V 


( 2 - 20 ) 


where it is assumed that initially P = P { . During pump-down the pressure thus 
exponentially decays to P 0 with time constant given by V/S p . At high 
pressures where viscous flow is involved, S p is a function of P, and, 
therefore, Eq. 2-10 is not strictly applicable in such cases. 
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2 . 3 . Vacuum Pumps and Systems 


2.3.1. Pumps 

The vacuum systems employed to deposit and characterize thin films contain 
an assortment of pumps, tubing, valves, and gauges to establish and measure 
the required reduced pressures (Ref. 7). Of these components pumps are 
generally the most important, and only they will be discussed at any length. 
Vacuum pumps may be divided into two broad categories: gas transfer pumps 
and entrapment pumps. Gas transfer pumps remove gas molecules from the 
pumped volume and convey them to the ambient in one or more stages of 
compression. Entrapment pumps condense or chemically bind molecules at 
walls situated within the chamber being pumped. In contrast to gas transfer 
pumps, which remove gas permanently, some entrapment pumps are reversible 
and release trapped (condensed) gas back into the system upon warm-up. 

Gas transfer pumps may be further subdivided into positive-displacement 
and kinetic vacuum pumps. Rotary mechanical and Roots pumps are important 
examples of the positive-displacement variety. Diffusion and turbomolecular 
pumps are the outstanding examples of kinetic vacuum pumps. Among the 
entrapment pumps commonly employed are the adsorption, sputter-ion, and 
cryogenic pumps. Each pump is used singly or in combination in a variety of 
pumping system configurations. Pumps do not remove the gas molecules by 
exerting an attractive pull on them. The molecules are unaware that pumps 
exist. Rather, the action of pumps is to limit, interfere with, or alter natural 
molecular motion. We start this brief survey of some of the more important 
pumps with the positive-displacement types. 


2.3.1.1. Rotary Mechanical Pump. The rotary piston and rotary vane 
pumps are the two most common devices used to attain reduced pressure. In 
the rotary piston pump shown in Fig. 2-7a, gas is drawn into space A as the 
keyed shaft rotates the eccentric and piston. There the gas is isolated from the 
inlet after one revolution, then compressed and exhausted during the next 
cycle. Piston pumps are often employed to evacuate large systems and to back 
Roots blower pumps. 

The rotary vane pump contains an eccentrically mounted rotor with spring- 
loaded vanes. During rotation the vanes slide in and out within the cylindrical 
interior of the pump, enabling a quantity of gas to be confined, compressed, 
and discharged through an exhaust valve into the atmosphere. Compression 
ratios of up to 10 6 can be achieved in this way. Oil is employed as a sealant as 
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Figure 2-7. (a) Schematic of a rotary piston pump: 1. eccentric; 2. piston; 3. shaft; 
4. gas ballast; 5. cooling water inlet; 6. optional exhaust; 7. motor; 8. exhaust; 9. oil mist 
separator; 10. poppet valve; 11. inlet; 12. hinge bar; 13. casing; 14. cooling water outlet. 
(Courtesy of Stokes Vacuum Inc.) 


well as a lubricant between components moving within tight clearances of both 
types of rotary pumps. To calculate the pumping speed, let us assume that a 
volume of gas, V 0 (liters), is enclosed between the rotor and pump stator 
housing and swept into the atmosphere for each revolution of the rotor. The 
intrinsic speed of the pump will then be S p = V 0 f s , where f s is the rotor 
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(b) 

Figure 2-7. (b) Schematic of the Roots pump. 

speed in revolutions per second. Typical values of S p for vane pumps range 
from 1 to 300 L/sec, and from 10 to 500 L/sec for piston pumps. At elevated 
pressures the actual pumping speed S is constant but eventually becomes zero 
at the ultimate pump pressure in a manner suggested by Eq. 2-19. Single-stage 
vane pumps have an ultimate pressure of 10“ 2 torr, and two-stage pumps can 
reach 10' 4 torr. Rotary pumps are frequently used to produce the minimal 
vacuum required to operate both oil diffusion and turbomolecular pumps, 
which can then attain far lower pressures. 

2 . 3 . 7 . 2 . Roots Pump. An important variant of the positive-displacement 
pump is the Roots pump, shown in Fig. 2-7b, where two figure-eight-shaped 
lobes rotate in opposite directions against each other. The extremely close 
tolerances eliminate the need to seal with oil. These pumps have very high 
pumping speeds, and even though they can attain ultimate pressures below 
10“ 5 torr, a forepump (e.g., rotary mechanical) is required. Maximum 
pumping is achieved in the pressure range of 10' 3 to 20 torr, where speeds of 
up to several thousand liters per second can be attained. This combination of 
characteristics has made Roots pumps popular in low-pressure chemical vapor 
deposition (LPCVD) systems where large volumes of gas continuously pass 
through reactors maintained at — 1 torr. 

2.3.1.3. Diffusion Pump. In contrast to mechanical pumps, the diffusion 
pump shown in Fig. 2-8 has no moving parts. Diffusion pumps are designed to 
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(a) 


INLET 



(b) 

Figure 2-8. (a) Diffusion pump; (b) schematic of pump interior (Courtesy of Varian 
Associates, Vacuum Products Division) 
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operate in the molecular flow regime and can function over pressures ranging 
from well below 10' 10 torr to about 10~ 2 torn Because they cannot discharge 
directly into the atmosphere, a mechanical forepump is required to maintain an 
outlet pressure of about 0.1 torr. Since the pump inlet is essentially like the 
orifice of Fig. 2-4, a pumping speed of 11.7^4 L/sec would be theoretically 
expected for air at room temperature. Actual pumping speeds are typically only 
0.4 of this value. 

Diffusion pumps have been constructed with pumping speeds ranging from a 
few liters per second to over 20,000 L/sec. Pumping is achieved through the 
action of a fluid medium (typically silicone oil) that is boiled and vaporized in 
a multistage jet assembly. As the oil vapor stream emerges from the top 
nozzles, it collides with and imparts momentum to residual gas molecules, 
which happen to bound into the pump throat. These molecules are thus driven 
toward the bottom of the pump and compressed at the exit side where they are 
exhausted. A region of reduced gas pressure in the vicinity of the jet is 
produced, and more molecules from the high-vacuum side move into this zone, 
where the process is repeated. Several jets working in series serve to enhance 
the pumping action. 

A serious problem associated with diffusion pumps is the backstreaming of 
oil into the chamber. Such condensed oil can contaminate both substrate and 
deposit surfaces, leading to poor adhesion and degraded film properties. Oil 
vapor dissociated on contact with hot filaments or by electrical discharges also 
leaves carbonaceous or siliceous deposits that can cause electrical leakage or 
even high-voltage breakdown. For these reasons, diffusion pumps are not used 
in surface analytical equipment such as Auger electron and secondary ion mass 
spectrometers or in ultrahigh-vacuum deposition systems. Nevertheless, diffu¬ 
sion-pumped systems are widely used in nonelectronic (e.g., decorative, 
optical, tool) coating applications. To minimize backstreaming, attempts are 
made to condense the oil before it enters the high-vacuum chamber. Cold caps 
on top of the uppermost jet together with refrigerated traps and optically dense 
baffles are used for this purpose, but at the expense of somewhat reduced 
conductance and pumping speed. 


2.3.7.4. Turbomolecular Pump . The drive to achieve the benefits of oil¬ 
less pumping has spurred the development and use of turbomolecular pumps. 
Like the diffusion pump, the turbomolecular pump imparts a preferred direc¬ 
tion to molecular motion, but in this case the impulse is caused by impact with 
a rapidly whirling turbine rotor spinning at rates of 20,000 to 30,000 revolu¬ 
tions per minute. The turbomolecular pump of Fig. 2-9 is a vertical, axial flow 
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la) 



(b) 

Figure 2-9. (a) Turbomolecular pump; (b) schematic of pump interior (Courtesy of 
Varian Associates, Vacuum Products Division) 
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compressor consisting of many rotor-stator pairs or stages mounted in series. 
Gas captured by the upper stages is transferred to the lower stages, where it is 
successively compressed to the level of the fore-vacuum pressure. The maxi¬ 
mum compression varies linearly with the circumferential rotor speed, but 
exponentially with the square root of the molecular weight of the gas. Typical 
compression ratios for hydrocarbons, N 2 and H 2 , are 10 10 , 10 9 , and 10\ 
respectively. Since the partial pressure of a given gas specie on the low-pres¬ 
sure (i.e., vacuum chamber) side of the pump is equal to that on the 
high-pressure (exhaust) side divided by the compression ratio, only H 2 will fail 
to be pumped effectively. An important consequence of the very high compres¬ 
sion is that oil backstreaming is basically reduced to negligible levels. In fact, 
no traps or baffles are required, and the turbomolecular pump can be backed 
by a rotary pump and effectively achieve oil-less pumping. Turbomolecular 
pumps are expensive, but are increasingly employed in all sorts of thin-film 
deposition and characterization equipment. Typical characteristics include 
pumping speeds of 10 3 L/sec and ultimate pressures below 10 10 torr. 


2.3.7.5. Cryopumps. Cryopumps are capable of generating a very clean 
vacuum in the pressure range of 10 -3 to 10“ 10 torr. These are gas entrapment 
pumps, which rely on the condensation of vapor molecules on surfaces cooled 
below 120 K. Temperature-dependent van der Waals forces are responsible for 
physically binding or sorbing gas molecules. Several kinds of surfaces are 
employed to condense gas. These include (1) untreated bare metal surfaces, (2) 
a surface cooled to 20 K containing a layer of precondensed gas of higher 
boiling point (e.g., Ar or C0 2 for H 2 or He sorption), and (3) a microporous 
surface of very large area within molecular sieve materials, such as activated 
charcoal or zeolite. The latter are the working media of the common sorption 
pumps, which achieve forepressures of about 10“ 3 torr by surrounding a steel 
canister containing sorbent with a Dewar of liquid nitrogen. Cryopumps 
designed to achieve ultrahigh vacuum (Fig. 2-10) have panels that are cooled to 
20 K by closed-cycle refrigerators. These cryosurfaces cannot be directly 
exposed to the room-temperature surfaces of the chamber because of the 
radiant heat load, so they are surrounded by liquid-nitrogen-cooled shrouds. 

The starting or forepressure, ultimate pressure, and pumping speed of 
cryopumps are important characteristics. Cryopumps require an initial fore- 
pressure of about 10“ 3 torr in order to prevent a prohibitively large thermal 
load on the refrigerant and the accumulation of a thick condensate on the 
cryopanels. The ultimate pressure (P ull ) attained for a given gas is reached 
when the impingement rate on the cryosurface, maintained at temperature T. 
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Figure 2-10. (a) Cryopump; (b) schematic of pump interior: 1. fore-vacuum port; 2. 

temperature sensor; 3. 77 K shield; 4. 20 K condenser with activated charcoal; 5. port 
for gauge head and pressure relief valve; 6. cold head; 7. compressor unit; 8. helium 
supply and return lines; 9. electrical supply cable; 10. high-vacuum flange; 11. pump 
housing; 12. temperature measuring instrument. (Courtesy of Balzers, High Vacuum 
Products) 
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equals that on the vacuum chamber walls held at 300 K. Therefore from 
Eq. 2-8, 

P*-P'{T)y/mjT, (2-21) 

where P S (T) is the saturation pressure of the pumped gas. As an example, for 
N 2 at 20 K, the P s value is about 10’ 11 torr, so P ult ( N 2 ) = 3.9 x 10“" torr. 

Because of high vapor pressures at 20 K, H 2 , as well as He and Ne, cannot 
be effectively cryopumped. Of all high-vacuum pumps, cryopumps have the 
highest pumping speeds since they are limited only by the rate of gas 
impingement. Therefore, the pumping speed is given by Eq. 2-10, which for 
air at 20 K is equal to 3 L/sec for each square centimeter of cooled surface. 
Although they are expensive, cryopumps offer the versatility of serving as the 
main pump or, more frequently, acting in concert with other conventional 
pumps (e.g., turbopumps). They are, therefore, becoming increasingly popular 
in thin-film research and processing equipment. 


2 . 3 . 7 . 6 . Sputter-Ion Pumps . The last pump we consider is the sputter-ion 
pump shown in Fig. 2-11, which relics on sorption processes initiated by 
ionized gas to achieve pumping. The gas ions are generated in a cold cathode 
electrical discharge between stainless steel anode and titanium cathode arrays 
maintained at a potential difference of a few kilovolts. Electrons emitted from 
the cathode are trapped in the applied transverse magnetic field of a few 
thousand gauss, resulting in a cloud of high electron density (e.g., 10 13 
electrons/cm 3 ). After impact ionization of residual gas molecules, the ions 
travel to the cathode and knock out or sputter atoms of Ti. The latter deposit 
elsewhere within the pump, where they form films that getter or combine with 
reactive gases such as nitrogen, oxygen, and hydrogen. These gases and 
corresponding Ti compounds are then buried by fresh layers of sputtered 
metal. 

Similar pumping action occurs in the Ti sublimation pump, where Ti metal 
is thermally evaporated (sublimed) onto cryogenically cooled surfaces. A 
combination of physical cryopumping and chemical sorption processes then 
ensues. Sputter-ion pumps display a wide variation in pumping speeds for 
different gases. For example, hydrogen is pumped several times more effec¬ 
tively than oxygen, water, or nitrogen and several hundred times faster than 
argon. Unlike cryopumping action, the gases are permanently removed. These 
pumps are quite expensive and have a finite lifetime that varies inversely with 
the operating pressure. They have been commonly employed in oil-less ultra- 
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(a) 


CONTROL UNIT 



Figure 2-11- (a) Sputter-ion pump; (b) schematic of pump interior. (Courtesy of 
Varian Associates, Vacuum Products Division) 
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Figure 2-12. Schematic of vacuum deposition system. 


high (10“ 10 torr) vacuum deposition and surface analytical equipment, but are 
being supplanted by turbomolecular and cryopumps. 


2.3.2. Systems 

The broad variety of applications requiring a low-pressure environment is 
reflected in a corresponding diversity of vacuum system design. One such 
system shown in Fig. 2-12 is employed for vacuum evaporation of metals. The 
basic pumping system consists of a nominal 15-cm diameter, multistage 
oil-diffusion pump backed by a 17-cfm (8.0 L/sec) rotary mechanical pump. In 
order to sequentially coat batch lots, the upper chamber must be vented to air 
in order to load substrates. To minimize the pumping cycle time, however, we 
desire to operate the diffusion pump continuously, thus avoiding the wait 
involved in heating or cooling pump oil. This means that the pump must always 
view a vacuum of better than ~ 10' 1 torr above and be backed by a similar 
pressure at the exhaust. A dual vacuum-pumping circuit consisting of three 
valves, in addition to vent valves, is required to accomplish these ends. 

When starting cold, the high-vacuum and roughing valves are closed and the 
backing valve is open. Soon after the oil heats up, a high vacuum is reached 
above the diffusion pump. The backing valve is then closed, thus isolating the 
diffusion pump, and the roughing valve is opened, enabling the rotary pump to 
evacuate the chamber to a tolerable vacuum of about 10“ 1 torr. Finally, the 
roughing valve is shut, and both the backing and high-vacuum valves are 
opened, allowing the diffusion pump to bear the main pumping burden. By 
reversal of the valving, the system can be alternately vented or pumped rapidly 
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and efficiently. This same operational procedure is followed in other 
diffusion-pumped systems, such as electron microscopes, where ease of speci¬ 
men exchange is a requirement. To eliminate human error, pump-down cycles 
are now automated or computerized through the use of pressure sensors and 
electrically actuated valves. In other oil-less vacuum systems a similar valving 
arrangement exists between the involved fore- and main pumps. 

Components worthy of note in the aforementioned evaporator are the 
high-vacuum valve and the optically dense baffle, both of which are designed 
to have a high conductance. Cryogenic cooling of the baffle helps prevent oil 
from backstreaming or creeping into the vacuum chamber. To ensure proper 
pressure levels for the functioning of the diffusion pump, thermocouple gauges 
are located in both the roughing and backing forelines. They operate from 
10 -3 torr to 1 atm. Ionization gauges, on the other hand, are sensitive to 
vacuum levels spanning the range 10“ 3 to lower than 10“ 10 torr and are, 
therefore, located to measure chamber pressure. Virtually all quoted vacuum 
pressures in thin-film deposition, processing, and characterization activities are 
derived from ionization gauges. 

An actual vacuum deposition system is shown in Fig. 2-13. 



Figure 2-13. Vacuum deposition system. (Courtesy of Cooke Vacuum Products) 
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2.3.3. System Pumping Considerations 

During the pump-down of a system, gas is removed from the chamber by (1) 
volume pumping and (2) pumping of species outgassed from internal surfaces. 
For volume pumping it is a relatively simple matter to calculate the time 
required to reach a given pressure. As an example, let us estimate the time 
required to evacuate a cylindrical bell jar, 46 cm in diameter and 76 cm high, 
from atmospheric pressure to a forepressure of 10“ 1 torr. If an 8-L/sec 
mechanical pump is used, then substitution of S p = 8 L/sec, V = 

(x/4)(46) 2 (76)/1000 = 126.3 L, P(t) = 10' 1 torr, P { = 760 torr, and P 0 
« 10' 4 torr in Eq. 2-20 yields a pump-down time of 2.35 min. This value is 
comparable to typical forepumping times in clean, tight systems. 

It is considerably more difficult to calculate pumping times in the high- 
vacuum regime where the system pressure depends on outgassing rates. There 
are two sources of this gas: (1) permeation and diffusion through the system 
walls and (2) desorption from the chamber surfaces and vacuum hardware. 
Specific vacuum materials, surface condition (smooth, porous, degree of 
cleanliness, etc.), and bakeout procedures critically affect the gas evolution 
rate. If the latter is known, however, it is possible to determine the necessary 
pumping speed at the required operating pressure through the use of Eq. 2-16. 
For example, suppose the vacuum system described has a total surface area of 
1.5 m 2 , including all accessories. If the gas evolution rate (throughput) q 0 is 
assumed to be 1.5 x 10 4 (torr-L/sec)/m 2 , then maintenance of a pressure 
P = 7.5 x 10“ 7 torr requires an effective pumping speed of S = 1.5q 0 /P = 
300 L/sec. This value of 5 is necessary only to pump the quantity of gas 
arising through gas evolution from the walls, and is clearly a lower bound for 
the effective pumping speed of the system. 

Lastly, it is appropriate to comment on vacuum system leaks. There is 
scarcely a thin-film technologist who has not struggled with them. No vacuum 
apparatus is absolutely vacuum-tight and, in principle, does not have to be. 
What is important, however, is that the leak rate be small and not influence the 
required working pressure, gas content, or ultimate system pressure. Leak 
rates are given in throughput units, e.g. torr-L/sec, and measured by noting 
the pressure rise in a system after isolating the pumps. The leak tightness of 
high-vacuum systems can be generally characterized by the following leakage 
rates (Ref. 7): 

Very leak tight—< 10“ 6 torr-L/sec 

Adequately leak tight-10“ 5 torr-L/sec 

Not leak tight— > 10' 4 torr-L/sec 
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One way to distinguish between gas leakage and outgassing from the vessel 
walls and hardware is to note the pressure rise with time. Gas leakage causes a 
linear rise in pressure, whereas outgassing results in a pressure rise that 
becomes gradually smaller and tends to a limiting value. The effect of leakage 
throughput on pumping time can be accounted for by inclusion in Eq. 2-20. 



Exercises 



1. Consider a mole of gas in a chamber that is not being pumped. What is 
the probability of a self-pumping action such that all of the gas molecules 
will congregate in one-half of the chamber and leave a perfect vacuum in 
the other half? 

2 . A 1-m 3 cubical-shaped vacuum chamber contains 0 2 molecules at a 
pressure of 10 -4 atm at 300 K. 


a. How many molecules are there in the chamber? 

b. What is the ratio of maximum potential energy to average kinetic 
energy of these molecules? 

c. What fraction of gas molecules has a kinetic energy in the x direction 
exceeding RT1 What fraction exceeds 2RT2 

3 - In many vacuum systems there is a gate valve consisting of a gasketed 
metal plate that acts to isolate the chamber above from the pumps below. 


a. A sample is introduced into the chamber at 760 torr while the isolated 
pumps are maintained at 10~ 6 torr. For a 15-cm-diameter opening, 
what force acts on the valve plate to seal it? 

b. The chamber is forepumped to a pressure of 10 -2 torr. What force 
now acts on the valve plate? 


4 . Supersonic molecular beams have a velocity distribution given by 


f(v) = >li/ 3 exp - 


M(v - v 0 ) 2 

2RT 


where u 0 , the stream velocity, is related to the Mach number. 


a. What does a plot of f(v) vs. v look like? 

b. What is the average gas speed in terms of u 0 , Af, and T1 Note: 


f 


r(« + 1/2) 




1/2 


f 


dx = 




2 a 


« + i 


Assume v 0 - 0. 
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5 . The trap in Fig. 2-5 is filled with liquid N 2 so that the entire trap surface 
is maintained at 80 K. What effect does this have on conductance? 

6 - Two identical lengths of piping are to be joined by a curved 90° elbow 
section or a sharp right-angle elbow section. Which overall assembly is 
expected to have a higher conductance? Why? 

7 . Show that the conductance of a pipe joining two large volumes through 
apertures of area A and A 0 is given by C = \\JAA 0 /{A - A Q ). 
[Hint: Calculate the conductance of the assembly in both directions.] 

8 - A chamber is evacuated by two sorption pumps of identical pumping 
speed. In one configuration the pumps are attached in parallel so that both 
pump simultaneously. In the second configuration they pump in serial or 
sequential order (one on and one off). Comment on the system pumping 
characteristics (pressure vs. time) for both configurations. 

9 - It is common to anneal thin films under vacuum in a closed-end quartz 
tube surrounded by a furnace. Consider pumping on such a cylindrical 
tube of length £, diameter £), and conductance C that outgasses uni¬ 
formly at a rate q 0 (torr-L/cm 2 -sec). Derive an expression for the 
steady-state pressure distribution along the tube axis. [Hint: Equate the 
gas load within length dx to throughput through the same length.] 

10. After evacuation of a chamber whose volume is 30 L to a pressure of 
1 X 10” 6 torr, the pumps are isolated. The pressure rises to 1 X 10“ 5 
torr in 3 min. 

a. What is the leakage rate? 

b. If a diffusion pump with an effective speed of 40 L/sec is attached to 
the chamber, what ultimate pressure can be expected? 

11 . Select any instrument or piece of equipment requiring high vacuum 
during operation (e.g., electron microscope, evaporator, Auger spectrom¬ 
eter, etc.). Sketch the layout of the vacuum-pumping components within 
the system. Explain how the gauges that measure the system pressure 
work. 

12. A system of volume equal to 1 m 3 is evacuated to an ultimate pressure of 
10“ 7 torr employing a 200 L/sec pump. For a reactive evaporation 
process, 100 cm 3 of gas (STP) must be continuously delivered through 
the system per minute. 
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a. What is the ultimate system pressure under these conditions? 

b. What conditions are necessary to maintain this process at 10" 2 torr? 

13. In a tubular low-pressure chemical vapor deposition (LPCVD) reactor, 
gas is introduced at one end at a rate of 75 torr-L/min. At the other end is 
a vacuum pump of speed S p . If the reactor must operate at 1 torr, what 
value of S p is required? 
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Physical Vapor Deposition 


3 . 1 . Introduction 


In this chapter we focus on evaporation and sputtering, two of the most 
important methods for depositing thin films. The objective of these deposition 
processes is to controllably transfer atoms from a source to a substrate where 
film formation and growth proceed atomistically. In evaporation, atoms are 
removed from the source by thermal means, whereas in sputtering they are 
dislodged from solid target (source) surfaces through impact of gaseous ions. 
The earliest experimentation in both of these deposition techniques can appar¬ 
ently be traced to the same decade of the nineteenth century. In 1852, Grove 
(Ref. 1) observed metal deposits sputtered from the cathode of a glow 
discharge. Five years later Faraday (Ref. 2), experimenting with exploding 
fuselike metal wires in an inert atmosphere, produced evaporated thin films. 

Advances in the development of vacuum-pumping equipment and the fabri¬ 
cation of suitable Joule heating sources, first made from platinum and then 
tungsten wire, spurred the progress of evaporation technology. Scientific 
interest in the phenomenon of evaporation and the properties of thin metal 
films was soon followed by industrial production of optical components such as 
mirrors, beam splitters, and, later, antireflection coatings. Simultaneously, 
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sputtering was used as early as 1877 to coat mirrors. Later applications 
included the coating of flimsy fabrics with Au and the deposition of metal films 
on wax masters of phonograph records prior to thickening. Up until the late 
1960s, evaporation clearly surpassed sputtering as the preferred film deposition 
technique. Higher deposition rates, better vacuum, and, thus, cleaner environ¬ 
ments for film formation and growth, and general applicability to all classes of 
materials were some of the reasons for the ascendancy of evaporation methods. 
However, films used for magnetic and microelectronic applications necessi¬ 
tated the use of alloys, with stringent stoichiometry limits, which had to 
conformally cover and adhere well to substrate surfaces. These demands plus 
the introduction of radio frequency (RF), bias, and magnetron variants, which 
extended the capabilities of sputtering, and the availability of high-purity 
targets and working gases, helped to promote the popularity of sputter deposi¬ 
tion. Today the decision of whether to evaporate or sputter films in particular 
applications is not always obvious and has fostered a lively competition 
between these methods. In other cases, features of both have been forged into 
hybrid processes. 

Physical vapor deposition (PVD), the term that includes both evaporation 
and sputtering, and chemical vapor deposition (CVD), together with all of their 
variant and hybrid processes, are the basic film deposition methods treated in 
this book. Some factors that distinguish PVD from CVD are: 

1. Reliance on solid or molten sources 

2. Physical mechanisms (evaporation or collisional impact) by which source 
atoms enter the gas phase 

3. Reduced pressure environment through which the gaseous species are 
transported 

4. General absence of chemical reactions in the gas phase and at the substrate 
surface (reactive PVD processes are exceptions) 

The remainder of the chapter is divided into the following sections: 

3.2. The Physics and Chemistry of Evaporation 

3.3. Film Thickness Uniformity and Purity 

3.4. Evaporation Hardware and Techniques 

3.5. Glow Discharges and Plasmas 

3.6. Sputtering 

3.7. Sputtering Processes 

3.8. Hybrid and Modified PVD Processes 

Additional excellent reading material on the subject can be found in Refs. 
3-6. The book by Chapman is particularly recommended for its entertaining 
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and very readable presentation of the many aspects relating to phenomena in 
rarefied gases, glow discharges, and sputtering. 


3 . 2 . The Physics and Chemistry of Evaporation 


3.2.1. Evaporation Rate 

Early attempts to quantitatively interpret evaporation phenomena are connected 
with the names of Hertz, Knudsen, and, later, Langmuir (Ref. 3). Based on 
experimentation on the evaporation of mercury. Hertz, in 1882, observed that 
evaporation rates were: 


1. Not limited by insufficient heat supplied to the surface of the molten 
evaporant 

2. Proportional to the difference between the equilibrium pressure P e of Hg at 
the given temperature and the hydrostatic pressure P h acting on the 
evaporant. 


Hertz concluded that a liquid has a specific ability to evaporate at a given 
temperature. Furthermore, the maximum evaporation rate is attained when the 
number of vapor molecules emitted corresponds to that required to exert the 
equilibrium vapor pressure while none return. These ideas led to the basic 
equation for the rate of evaporation from both liquid and solid surfaces, 
namely, 


«MP € -p h ) 

V2ttMRT 


(3-1) 


where $ e is the evaporation flux in number of atoms (or molecules) per unit 
area per unit time, and a e is the coefficient of evaporation, which has a value 
between 0 and 1. When a e = 1 and P h is zero, the maximum evaporation rate 
is realized. By analogy with Eq. 2-9, an expression for the maximum value of 
$ e is 

p 

$ e = 3.513 X 10 22 -r== - molecules/cm 2 -sec. (3-2) 

wMT 

when P e is expressed in torr. A useful variant of this formula is 


r e = 5.834 x 10' 2 \[MjTP e g/cm 2 -sec, 


(3-3) 
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where Y e is the mass evaporation rate. At a pressure of 10” 2 torr, a typical 
value of T e for many elements is approximately 10” 4 g/cm 2 -sec of evaporant. 
The key variable influencing evaporation rates is the temperature, which has a 
profound effect on the equilibrium vapor pressure. 


3.2.2. Vapor Pressure of the Elements 

A convenient starting point for expressing the connection between temperature 
and vapor pressure is the Clausius-Clapeyron equation, which for both 
solid-vapor and liquid-vapor equilibria can be written as 

, 3 - 4 ) 

dT TAV v } 


The changes in enthalphy, AH(T ), and volume, AV, refer to differences 
between the vapor (u) and the particular condensed phase (c) from which it 
originates, and T is the transformation temperature in question. Since AV = 
V v - V c , and the volume of vapor normally considerably exceeds that of the 
condensed solid or liquid phase, AV = V v . If the gas is assumed to be perfect, 
V v * RT/P , and Eq. 3-4 may be rewritten as 


dP PAH(T) 
~dT ~ RT~ 2 


(3-5) 


As a first approximation, AH(T) = AHe, the molar heat of evaporation (a 
constant), in which case simple integration yields 

AHe , X 

In P rn -+ /, 3-6) 

RT V } 

where / is a constant of integration. Through substitution of the latent heat of 
vaporization for AHe, the boiling point for 7, and 1 atm for P, / can be 
evaluated for the liquid-vapor transformation. For practical purposes, Eq. 3-6 
adequately describes the temperature dependence of the vapor pressure in 
many materials. It is rigorously applicable only over a small temperature 
range, however. To extend the range of validity, we must account for 
the temperature dependence of A H{T). For example, careful evaluation of 
thermodynamic data reveals that the vapor pressure of liquid A1 is given by 
(Ref. 3) 

log P (vm) = 15,993/7+ 12.409 - 0.999 log T - 3.52 x 10~ 6 7\ (3-7) 
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TEMPERATURE (°K) 



Figure 3-1. Vapor pressures of selected elements. Dots correspond to melting points. 
(From Ref. 7). 


The Arrhenius character of log P vs. 1/7’ is essentially preserved, since the 
last two terms on the right-hand side are small corrections. 

Vapor-pressure data for many other metals have been similarly obtained and 
conveniently represented as a function of temperature in Fig. 3-1. Similarly, 
vapor-pressure data for elements important in the deposition of semiconductor 
films are presented in Fig. 3-2. Much of the data represent direct measure¬ 
ments of the vapor pressures. Other values are inferred indirectly from 
thermodynamic relationships and identities using a limited amount of experi¬ 
mental data. Thus the vapor pressures of refractory metals such as W and Mo 
can be unerringly extrapolated to lower temperatures, even though it may be 
impossible to measure them directly. 

Two modes of evaporation can be distinguished in practice, depending on 
whether the vapor effectively emanates from a liquid or solid source. As a rule 
of thumb, a melt will be required if the element in question does not achieve a 
vapor pressure greater than 10“ 3 torr at its melting point. Most metals fall into 
this category, and effective film deposition is attained only when the source is 
heated into the liquid phase. On the other hand, elements such as Cr, Ti, Mo, 
Fe, and Si reach sufficiently high vapor pressures below the melting point and, 
therefore, sublime. For example, Cr can be effectively deposited at high rates 
from a solid metal source because it attains vapor pressures of 10 2 torr some 
500 °C below the melting point. The operation of the Ti sublimation pump 
mentioned in Chapter 2 is, in fact, based on the sublimation from heated Ti 
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Figure 3-2. Vapor pressures of elements employed in semiconductor materials. Dots 
correspond to melting points. (Adapted from Ref. 8). 


filaments. A third example is carbon, which is used to prepare replicas of the 
surface topography of materials for subsequent examination in the electron 
microscope. The carbon is sublimed from an arc struck between graphite 
electrodes. 


3.2.3. Evaporation of Compounds 

While metals essentially evaporate as atoms and occasionally as clusters of 
atoms, the same is not true of compounds. Very few inorganic compounds 
evaporate without molecular change, and, therefore, the vapor composition is 
usually different from that of the original solid or liquid source. A consequence 
of this is that the stoichiometry of the film deposit will generally differ from 
that of the source. Mass spectroscopic studies of the vapor phase have shown 
that the processes of molecular association as well as dissociation frequently 
occur. A broad range of evaporation phenomena in compounds occurs, and 
these are categorized briefly in Table 3-1. 
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Table 3-1. Evaporation of Compounds 


Reaction 

Type 

Chemical Reaction 

Examples 

Comments 

Evaporation 

MX(s or /) -> MX(g) 

SiO, B 2 0 3 

Compound 

without 


GeO, SnO, AIN 

stoichiometry 

dissociation 


CaF 2> MgF 2 

maintained in 
deposit 

Decomposition 

MX(s) -* M(s) + (l/2)X 2 (g) 

Ag 2 S, Ag 2 Se 

Separate 

sources are 

Evaporation 
with dissociation 

MX(i) -* M(/) + (1 /n)X n (g) 

m-v 

semiconductors 

required to 
deposit these 
compounds 
Deposits are 
metal-rich; 

a. Chalcogenides 

MX(s) - M(g) + (l/2)X 2 (g) 

CdS, CdSe 

separate 

X = S, Se, Te 


CdTe 

sources are 

required to 
deposit these 
compounds 

b. Oxides 

M0 2 (s) -» MO(g) + (l/2)0 2 (g) 

Si0 2 , Ge0 2 
Ti0 2 , SnO 2 

Zr0 2 

Metal-rich 

discolored 
deposits; 
dioxides are 
best deposited 
in0 2 partial 
pressure 

(reactive 

evaporation) 


Note M = metal, X = nonmetal. 
Adapted from Ref. 3. 


3.2.4. Evaporation of Alloys 

Evaporated metal alloy films are widely utilized for a variety of electronic, 
magnetic, and optical applications as well as for decorative coating purposes. 
Important examples of such alloys that have been directly evaporated include 
Al-Cu, Permalloy (Fe-Ni), nichrome (Ni-Cr), and Co-Cr. Atoms in metals 
of such alloys are generally less tightly bound than atoms in the inorganic 
compounds discussed previously. The constituents of the alloys, therefore, 
evaporate nearly independently of each other and enter the vapor phase as 
single atoms in a manner paralleling the behavior of pure metals. Metallic 
melts are solutions and as such are governed by well-known thermodynamic 
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laws. When the interaction energy between A and B atoms of a binary AB 
alloy melt are the same as between A-A and B-B atom pairs, then no 
preference is shown for atomic partners. Such is the environment in an ideal 
solution. Raoult’s law, which holds under these conditions, states that the 
vapor pressure of component B in solution is reduced relative to the vapor 
pressure of pure B (/> B (0)) in proportion to its mole fraction X B . Therefore, 

*b = *b*b(0). (3-8) 

Metallic solutions usually are not ideal, however. This means that either 
more or less B will evaporate relative to the ideal solution case, depending on 
whether the deviation from ideality is positive or negative, respectively. A 
positive deviation occurs because B atoms are physically bound less tightly to 
the solution, facilitating their tendency to escape or evaporate. In real solutions 

*b = *b*b(0), (3-9) 

where ct B is the effective thermodynamic concentration of B known as the 
activity. The activity is, in turn, related to X B through an activity coefficient 
7 B ; i* e *> 

~ Tb^b- (3-10) 

By combination of Eqs. 3-2, 3-9, and 3-10, the ratio of the fluxes of A and 
B atoms in the vapor stream above the melt is given by 

»A 7a*a*a(0) [Mb , n 

* B 7b*b*b(0) V M a ‘ } 


Practical application of this equation is difficult because the melt composition 
changes as evaporation proceeds. Therefore, the activity coefficients, which 
can sometimes be located in the metallurgical literature, but just as frequently 
not, also change with time. As an example of the use of Eq. 3-11, consider the 
problem of estimating the approximate Al-Cu melt composition required to 
evaporate films containing 2 wt% Cu from a single crucible heated to 1350 K. 
Substituting gives 


*A1 

98/A/ ai 

*a.(0) 

*cu 

' 2/ M Cu ' 

*cu(0) 

*AI 

98 2 X 10~ 4 

163.7 


2 10“ 3 

V 27.0 


and assuming 7 Cu = y M , 


This suggests that the original melt composition should be enriched to 13.6 
wt% Cu in order to compensate for the preferential vaporization of Al. It is, 
therefore, feasible to evaporate such alloys from one heated source. If the alloy 
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melt is of large volume, fractionation-induced melt composition changes are 
minimal. A practical way to cope with severe fractionation is to evaporate 
from dual sources maintained at different temperatures. 

3 . 3 . Film Thickness Uniformity and Purity 
3.3.1. Deposition Geometry 

In this section aspects of the deposition geometry, including the characteristics 
of evaporation sources, and the orientation and placement of substrates are 
discussed. The source-substrate geometry, in turn, influences the ultimate film 
uniformity, a concern of paramount importance, which will be treated subse¬ 
quently. Evaporation from a point source is the simplest of situations to model. 
Evaporant particles are imagined to emerge from an infinitesimally small 
region ( dA e ) of a sphere of surface area A e with a uniform mass evaporation 
rate as shown in Fig. 3-3a. The total evaporated mass M e is then given by the 
double integral 

M e = H Y e dA e dt. (3-12) 

*^0 J A e 

Of this amount, mass dM s falls on the substrate of area dA s . Since the 
projected area dA s on the surface of the sphere is dA c , with dA c = dA 5 cos 8 , 
the proportionality dM s : M e = dA c : 4xr 2 holds. Finally, 

dM s M e cos 0 

dA s 4 tt r 2 

is obtained. On a per unit time basis we speak of deposition rate R 
(atoms/cm 2 -sec), a related quantity referred to later in the book. The deposi- 


POINT SOURCE SURFACE SOURCE 

Figure 3-3. Evaporation from (a) point source, (b) surface source. 
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tion varies with the geometric orientation of the substrate and with the inverse 
square of the source-substrate distance. Substrates placed tangent to the 
surface of the receiving sphere would be coated uniformly, since cos 0=1. 

An evaporation source employed in the pioneering research by Knudsen 
made use of an isothermal enclosure with a very small opening through which 
the evaporant atoms or molecules effused. These effusion or Knudsen cells are 
frequently employed in molecular-beam epitaxy deposition systems, where 
precise control of evaporation variables is required. Kinetic theory predicts that 
the molecular flow of the vapor through the opening is directed according to a 
cosine distribution law, and this has been verified experimentally. The mass 
deposited per unit area is given by 

dM s M e cos <i> cos 0 

dA s irr 2 

and now depends on two angles (emission and incidence) that are defined in 
Fig. 3-3b. Evaporation from a small area or surface source is also modeled by 
Eq. 3-14. Boat filaments and wide crucibles containing a pool of molten 
material to be evaporated approximate surface sources in practice. 

From careful measurements of the angular distribution of film thickness, it 
has been found that, rather than a cos <j> dependence, a cos n <f> evaporation law 
is more realistic. As shown in Fig. 3-4, n is a number that determines the 
geometry of the lobe-shaped vapor cloud and the angular distribution of 
evaporant flux from a source. When n is large, the vapor flux is highly 
directed. Physically n is related to the evaporation crucible geometry and 
scales directly with the ratio of the melt depth (below top of crucible) to the 


(3-14) 



Figure 3-4. Calculated lobe-shaped vapor clouds with various cosine exponents. 
(From Ref. 9). 
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melt surface area. Deep narrow crucibles with large n have been employed to 
confine evaporated radioactive materials to a narrow angular spread in order to 
minimize chamber contamination. The corresponding deposition equation is 
(Ref. 9) 

dM s M e (/? + l)cos n <£cos0 
dA s 2 tt r 2 

As the source becomes increasingly directional, the surface area effectively 
exposed to evaporant shrinks (i.e., 2xr 2 , irr 2 , and lirr 2 /(n 4- 1) for point, 
cos <)>, and cos n <j) sources, respectively). 


(n > 0). (3-15) 


3.3.2. Film Thickness Uniformity 


While maintaining thin-film thickness uniformity is always desirable, but not 
necessarily required, it is absolutely essential for microelectronic and many 
optical coating applications. For example, thin-film, narrow-band optical inter¬ 
ference filters require a thickness uniformity of ±1%. This poses a problem, 
particularly if there are many components to be coated or the surfaces involved 
are large or curved. Utilizing formulas developed in the previous section, we 
can calculate the thickness distribution for a variety of important source-sub¬ 
strate geometries. Consider evaporation from the point and small surface 
source onto a parallel plane-receiving substrate surface as indicated in the 
insert of Fig. 3-5. The film thickness d is given by dMjp dA s , where p is 
the density of the deposit. For the point source 

M e cos 6 M e h M e h 

d = —r = =- f - 372 • ( 3 “ 16 ) 

4ir pr 2 4irpr 3 4t p(h 2 + l 2 )' 


The thickest deposit ( d 0 ) occurs at / = 0, in which case d 0 = M e /A'Kph 2 y 
and, thus, 


d 1 

{i+(//*) i ) s/i 


(3-17) 


Similarly, for the surface source 

M. cos 0 cos <t> M. h h M.h 2 

d --- 5 -^- £ 3-18) 

irpr 2 t rpr 2 r r + / 2 )‘ 

since cos0 = cos <t> = h/r. When normalized to the thickest dimensions, or 
d 0 = MJirph 2 , 


d _ 1 

rf o“ {l + (l/h) 2 } 2 


(3-19) 
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l/h 

Figure 3-5. Film thickness uniformity for point and surface sources. (Insert) Geome¬ 
try of evaporation onto parallel plane substrate. 

A comparison of Eqs. 3-17 and 3-19 is made in Fig. 3-5, where it is apparent 
that less thickness uniformity can be expected with the surface source. 

A couple of practical examples (Ref. 10) will demonstrate how these film 
thickness distributions are used in designing source-substrate geometries for 
coating applications. In the first example suppose it is desired to coat a 
150-cm-wide strip utilizing two evaporation sources oriented as shown in the 
insert of Fig. 3-6. If a thickness tolerance of ± 10% is required, what should 
the distance between sources be and how far should they be located from the 
substrate? A superposition of solutions for two individual surface sources (Eq. 
3-19) gives the thickness variation shown graphically in Fig. 3-6 as a function 
of the relative distance r from the center line for various values of the source 
spacing D. All pertinent variables are in terms of dimensionless ratios r/h v 
and D/h v . The desired tolerance requires that d/d 0 stay between 0.9 and 1.1, 
and this can be achieved with D/h v = 0.6 yielding a maximum value of 
r/h v — 0.87. Since r = 150/2 = 75 cm, h v = 75/0.87 = 86.2 cm. The required 
distance between sources is therefore 2D = 2 X 0.6 X 86.2 = 103.4 cm. There 
are other solutions, of course, but we are seeking the minimum value of h v . It 
is obvious that the uniformity tolerance can always be realized by extending 
the source-substrate distance, but this wastes evaporant. 

As a second example, consider a composite optical coating where a ± 1 % 
film thickness variation is required in each layer. The substrate is rotated to 
even out source distribution anomalies and minimize preferential film growth 
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that can adversely affect coating durability and optical properties. Since 
multiple films of different composition will be sequentially deposited, the 
necessary fixturing requires that the sources be offset from the axis of rotation 
by a distance R = 20 cm. How high above the source should a 25-cm-diame- 
ter substrate be rotated to maintain the desired film tolerance? The film 
thickness distribution in this case is a complex function of the three-dimen¬ 
sional geometry, which, fortunately, has been graphed in Fig. 3-7. Reference 
to this figure indicates that the curve h v /R = 1.33 in conjunction with 
r/R = 0.6 will generate a thickness deviation ranging from about -0.6 to 
+ 0.5%. On this basis, the required distance is h v = 1.33 x 20 = 26.6 cm. 

A clever way to achieve thickness uniformity, however, is to locate both the 
surface evaporant source and the substrates on the surface of a sphere as shown 
in Fig. 3-8. In this case, cos 0 = cos 0 = r/2r 0 , and Eq. 3-14 becomes 

dM s M e r r M 

- 1 = —T-= - h ■ 3-20) 

dA s irr 2 2r 0 2 r 0 4i 

The resultant deposit thickness is a constant clearly independent of angle. Use 
is made of this principle in the planetary substrate fixtures that hold silicon 
wafers to be coated with metal (metallized) by evaporation. To further promote 
uniform coverage, the planetary fixture is rotated during deposition. Physi- 



0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 091.0 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2 

RELATIVE DISTANCE FROM CENTER LINE — 

h v 

Figure 3-6. Film thickness uniformity across a strip employing two evaporation 
sources for various values of D/h v . (From Ref. 10). 




92 


Physical Vapor Deposition 



Figure 3-7. Calculated film thickness variation across the radius of a rotating disk. 
(From Ref. 10). 


WAFER 

SUBSTRATES 



Figure 3-8. Evaporation scheme to achieve uniform deposition. Source and sub¬ 
strates lie on sphere of radius r 0 . 
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cally, deposition uniformity is achieved because short source-substrate dis¬ 
tances are offset by unfavorably large vapor emission and deposition angles. 
Alternately, long source-substrate distances are compensated by correspond¬ 
ingly small emission and reception angles. For sources with a higher degree of 
directionality (i.e., where cos^ rather than cos <t> is involved), the reader can 
easily show that thickness uniformity is no longer maintained. 

Two principal methods for optimizing film uniformity over large areas 
involve varying the geometric location of the source and interposing static as 
well as rotating shutters between evaporation sources and substrates. Computer 
calculations have proven useful in locating sources and designing shutter 
contours to meet the stringent demands of optical coatings. Film thickness 
uniformity cannot, however, be maintained beyond ± 1 % because of insuffi¬ 
cient mechanical stability of both the stationary and rotating hardware. 

In addition to the parallel source-substrate configuration, calculations of 
thickness distributions have also been made for spherical as well as conical, 
parabolic, and hyperbolic substrate surfaces (Ref. 9). Similarly, cylindrical, 
wire, and ring evaporation source geometries have been treated (Ref. 11). For 
the results, interested readers should consult the appropriate references. 


3.3.3. Conformal Coverage 

An issue related to film uniformity is step or, more generally, conformal 
coverage, and it arises primarily in the fabrication of integrated circuits. The 
required semiconductor contact and device interconnection metallization de¬ 
positions frequently occur over a terrain of intricate topography where mi¬ 
crosteps, grooves, and raised stripes abound. When the horizontal as well as 



Figure 3-9. Schematic illustration of film coverage of stepped substrate: (A) uniform 
coverage; (B) poor sidewall coverage; (C) lack of coverage—discontinuous film. 
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vertical surfaces of substrates are coated to the same thickness, we speak of 
conformal coverage. On the other hand, coverage will not be uniform when 
physical shadowing effects cause unequal deposition on the top and sidewalls 
of steps. Inadequate step coverage can lead to minute cracks in the metaliza- 
tion, which have been shown to be a major source of failure in device 
reliability testing. Thinned regions on conducting stripes exhibit greater Joule 
heating, which sometimes fosters early burnout. Step coverage problems have 
been shown to be related to the profile of the substrate step as well as to the 
evaporation source-substrate geometry. The simplest model of evaporation 
from a point source onto a stepped substrate results in either conformal 
coverage or a lack of deposition in the step shadow, as shown schematically in 
Fig. 3-9. Line-of-sight motion of evaporant atoms and sticking coefficients of 
unity can be assumed in estimating the extent of coverage. 

More realistic computer modeling of step coverage has been performed for 
the case in which the substrate is located on a rotating planetary holder (Ref. 

o 

12). In Fig. 3-10 coverage of a 1-jim-wide, 1-^m-high test pattern with 5000 A 




Figure 3-10. Comparison of simulated and experimental A1 film coverage of I-jtm 
line step and trench features. (Left) Orientation of most symmetric deposition. (Right) 
Orientation of most asymmetric deposition. (Reprinted with permission from Cowan 
Publishing Co., from C. H. Ting and A. R. Neureuther, Solid Slate Technology 25, 
115, 1982). 
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of evaporated A1 is simulated and compared with experiment. In the symmetric 
orientation the region between the pattern stripes always manages to “see” the 
source, and this results in a small plateau of full film thickness. In the 
asymmetric orientation, however, the substrate stripes cast a shadow with 
respect to the source biasing the deposition in favor of unequal sidewall 
coverage. In generating the simulated film profiles, the surface migration of 
atoms was neglected, a valid assumption at low substrate temperatures. Heat¬ 
ing the substrate increases surface diffusion of depositing atoms, thus promot¬ 
ing the filling of potential voids as they form. Interestingly, similar step 
coverage problems exist in chemical-vapor-deposited Si0 2 and silicon nitride 
films. 

3.3.4. Film Purity 

The chemical purity of evaporated films depends on the nature and level of 
impurities that (1) are initially present in the source, (2) contaminate the source 
from the heater, crucible, or support materials, and (3) originate from the 
residual gases present in the vacuum system. In this section only the effect of 
residual gases on film purity will be addressed. During deposition the atoms 
and molecules of both the evaporant and residual gases impinge on the 
substrate in parallel, independent events. The evaporant vapor impingement 
rate is pN A d/M a atoms/cm 2 -sec, where p is the density and d is the 
deposition rate (cm/sec). Simultaneously, gas molecules impinge at a rate 
given by Eq. 2-9. The ratio of the latter to the former is the impurity 
concentration C ; : 

p M 

C, = 5.82 X 10~ 2 - r = — (3-21) 

' jMJ pd ’ 

Terms M a and M g refer to evaporant and gas molecular weights, respec¬ 
tively, and P is the residual gas vapor pressure in torr. 

Table 3-2 illustrates the combined role that deposition rate and residual 
pressure play in determining the oxygen level that can be incorporated into thin 
tin films (Ref. 13). Although the concentrations are probably overestimated 
because the sticking probabiltiy of 0 2 is about 0.1 or less, the results have 
several important implications. To produce very pure films, it is important to 
deposit at very high rates while maintaining very low background pressures of 
residual gases such as H 2 0, C0 2 , CO, 0 2 , and N 2 . Neither of these 
requirements is too formidable for vacuum evaporation, where deposition rates 
from electron-beam sources can reach 1000 A/sec at chamber pressures of 
~ 10“ 8 torr. 

On the other hand, in sputtering processes, discussed later in the chapter. 
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Table 3-2. Maximum Oxygen Concentration in Tin Films Deposited 
at Room Temperature 


P °2 

(torr) 


Deposition Rate (A /sec) 



1 

10 

100 

1000 

10“ 9 

nr 3 

10- 4 

10 -5 

10' 6 

10“ 7 

10“' 

10' 2 

10“ 3 

10- 4 

10“ 5 

10 

1 

10“' 

10' 2 

10“ 3 

10 3 

10 2 

10 

1 


From Ref. 13. 


deposition rates are typically more than an order of magnitude less, and 
chamber pressures five orders of magnitude higher than for evaporation. 
Therefore, the potential exists for producing films containing high gas concen¬ 
trations. For this reason sputtering was traditionally not considered to be as 
“clean” a process as evaporation. Considerable progress has been made in the 
last two decades, however, with the commercial development of high-deposi- 
tion-rate magnetron sputtering systems, operating at somewhat lower gas 
pressures in cleaner vacuum systems. In the case of aluminum films, compara¬ 
ble purities appear to be attained in both processes. Lastly, Table 3-2 suggests 
that very high oxygen incorporation occurs at residual gas pressures of 10' 3 
torr. Advantage of this fact is taken in reactive evaporation processes where 
intentionally introduced gases serve to promote reactions with the evaporant 
metal and control the deposit stoichiometry. 

The presence of gaseous impurities within metal films sometimes has a 
pronounced effect in degrading many of its properties. Oxygen and nitrogen 
incorporation has been observed to reduce both the electrical conductivity and 
optical reflectivity as well as increase the hardness of A1 films (Ref. 14). 


3 . 4 . Evaporation Hardware and Techniques 


3.4.1. Resistance-Heated Evaporation Sources 

This section will primarily be devoted to a brief description of the most widely 
used methods for heating evaporants. Clearly, heaters must reach the tempera¬ 
ture of the evaporant in question while having a negligible vapor pressure in 
comparison. Ideally, they should not contaminate, react, or alloy with the 
evaporant or release gases such as oxygen, nitrogen, or hydrogen at the 
evaporation temperature. These requirements have led to the development and 
use of suitable resistance and electron-beam-heated sources. 
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Figure 3-11. Assorted resistance heated evaporation sources. (Courtesy of R. D. 
Mathis Company). 

Resistively heated evaporation sources are available in a wide variety of 
forms utilizing refractory metals singly or in combination with inert oxide or 
ceramic compound crucibles. Some of these are shown in Fig. 3-11. They can 
be divided into the following important categories. 

3 .4.1.1. Tungsten Wire Sources . Tungsten wire sources are in the form 
of individual or multiply stranded wires twisted into helical or conical shapes. 
Helical coils are used for metals that wet tungsten readily, whereas the conical 
baskets are better adapted to contain poorly wetting materials. In the former 
case, metal evaporant wire is wrapped around or hung from the tungsten 
strands, and the molten beads of metal are retained by surface tension forces. 

3.4.1. 2 . Refractory Metal Sheet Sources . Tungsten, tantalum, and 
molybdenum sheet metal sources, like the wire filaments, are self-resistance 
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heaters that require low-voltage, high-current power supplies. These sources 
have been fabricated into a variety of shapes, including dimpled strip, boat, 
canoe, and deep-folded configurations. Folded boat sources have been used to 
evaporate MgF 2 by containing the bulk salt and melting it prior to vaporiza¬ 
tion. Powder mixtures of metals and metal oxides used for coating ophthalmic 
lenses have been similarly evaporated from deep-folded boats in batch-type 
evaporators. 

3.4.7.3. Sublimation Furnaces . Efficient evaporation of sulfides, se- 
lenides, and some oxides is carried out in sublimation furnaces. The evaporant 
materials in powder form are pressed and sintered into pellets and heated by 
surrounding radiant heating sources. Spitting and ejection of particles caused 
by evolution of gases occluded within the source compacts are avoided through 
the use of baffled heating assemblies. These avoid direct line-of-sight access to 
substrates, and evaporation rates from such sources tend to be constant over 
extended periods of time. The furnaces are typically constructed of sheet 
tantalum, which is readily cut, bent, and spot-welded to form heaters, radiation 
shields, supports, and current bus strips. 

3.4. 1. 4. Crucible Sources . The most common sources are cylindrical cups 
composed of oxides, pyrolytic BN, graphite, and refractory metals, which are 
fabricated by hot-pressing powders or machining bar stock. These crucibles are 
normally heated by external tungsten wire heating elements wound to fit snugly 
around them. 

Other crucible sources rely on high-frequency induction rather than resis¬ 
tance heating. In a configuration resembling a transformer, high-frequency 
currents are induced in either a conducting crucible or evaporant charge 
serving as the secondary, resulting in heating. The powered primary is a coil 
of water-cooled copper tubing that surrounds the crucible. Aluminum evapo¬ 
rated from BN or BN/TiB 2 composite crucibles, in order to metallize inte¬ 
grated circuits, is an important example of the use of induction heating. 

Another category of crucible source consists of a tungsten wire resistance 
heater in the form of a conical basket encased in A1 2 0 3 or refractory oxide to 
form an integral crucible-heater assembly. Such crucibles frequently serve as 
evaporant sources in laboratory scale film deposition systems. 

3.4.2. Electron-Beam Evaporation 

Disadvantages of resistively heated evaporation sources include possible con¬ 
tamination by crucibles, heaters, and support materials and the limitation of 
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relatively low input power levels. This makes it difficult to deposit pure films 
or evaporate high-melting-point materials at appreciable rates. Electron-beam 
heating eliminates these disadvantages and has, therefore, become the most 
widely used vacuum evaporation technique for preparing highly pure films. In 
principle, this type of source enables evaporation of virtually all materials at 
almost any rate. As shown in Fig. 3-12, the evaporant charge is placed in 
either a water-cooled crucible or in the depression of a water-cooled copper 
hearth. The purity of the evaporant is ensured because only a small amount of 
charge melts or sublimes so that the effective crucible is the unmelted skull 
material next to the cooled hearth. For this reason there is no contamination of 
the evaporant by Cu. Multiple-source units are available for the sequential or 
parallel deposition of more than one material. 

In the most common configuration of the gun source, electrons are thermion- 
ically emitted from heated filaments, which are shielded from direct line of 
sight of the evaporant charge and substrate. Film contamination from the 
heated electron source is eliminated in this way. The filament cathode assem¬ 
bly potential is biased negatively with respect to a nearby grounded anode by 
anywhere from 4 to 20 kV, and this serves to accelerate the electrons. In 
addition, a transverse magnetic field is applied, which serves to deflect the 
electron beam in a 270° circular arc and focus it on the hearth and evaporant 
charge at ground potential. The reader can verify the electron trajectory 
through the use of the right-hand rule. This states that if the thumb is in the 
direction of the initial electron emission, and the middle finger lies in the 
direction of the magnetic field (north to south), then the forefinger indicates 
the direction of the force on the electron and its resultant path at any instant. 

It is instructive to estimate the total power that must be delivered by the 
electron beam to the charge in order to compensate for the following heat 
losses incurred during evaporation of 10 18 atoms/cm 2 -sec (Ref. 5). 

1. The power density P s (watts/cm 2 ) that must be supplied to account for the 
heat of sublimation A H s (eV) is 

P s = 10 18 (l .6 x 10“ * 9 ) A// 5 = 0.16 A// 5 . (3-22a) 

2. The kinetic energy of evaporant is (3/2 )kT s per atom so that the required 
power density P k is 

P k = 10 18 (3/2)(l .38 x 10- 23 )7; = 2.07 x 10“ 5 7;, (3-22b) 

where T s is the source temperature. 

3. The radiation heat loss density is 

P r = 5.67 x 10' 12 ^(7; 4 - r 4 ), 
where £ is the source emissivity at T s , and T 0 ^ 293 K. 


(3-22c) 
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4. Heat conduction through a charge of thickness l into the hearth dissipates a 
power density P c equal to 

/>c = *(^T^)’ (3 ' 22d) 

where k is the thermal conductivity of the charge. For the case of Au at T s = 
1670 K, where A H s ~ 3.5 eV, e ~ 0.4, l = 1 cm, and « =3.1 W/cm-K, 
the corresponding values are P s = 0.56 W/cm 2 , P k = 0.034 W/cm 2 , P r = 
17.6 W/cm 2 , and P c = 4.3 kW/cm 2 . Clearly the overwhelming proportion of 
the power delivered by the electron beam is conducted through the charge to 
the hearth. In actuality, power densities of - 10 kW/cm 2 are utilized in 
melting metals, but such levels would damage dielectrics, which require 
perhaps only 1-2 kW/cm 2 . To optimize evaporation conditions, provision is 
made for altering the size of the focal spot and for electromagnetically 
scanning the beam. 

3.4.3. Deposition Techniques 

By now, films of virtually all important materials have been prepared by 
physical vapor deposition techniques. A practical summary (Refs. 15-17) of 
vacuum evaporation methods is given in Table 3-3, where recommended 
heating sources and crucible materials are listed for a number of metals, alloys, 
oxides, and compounds. Prior to settling on a particular vapor phase deposition 
process, both PVD and CVD options should be investigated together with the 
numerous hybrid variants of these methods (see Section 3.8). Paramount 
attention should be paid to film quality and properties, and the requirements 
and costs necessary to achieve them. If, after all, vacuum evaporation is 
selected, modestly equipped laboratories may wish to consider the resistively 
heated sources before the more costly electron-beam or induction heating 
alternatives. 


3 . 5 . Glow Discharges and Plasmas 


3.5.1. Introduction 

A perspective of much of the contents of the remainder of the chapter can be 
had by considering the simplified sputtering system shown in Fig. 3-13a. The 
target is a plate of the materials to be deposited or the material from which a 
film is synthesized. Because it is connected to the negative terminal of a dc or 
RF power supply, the target is also known as the cathode. Typically, several 
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Table 3-3. Evaporation Characteristics of Materials 


Materia! 

Minimum 1 * 
Evap. Temp 

State of 

Evaporation 

Recommended 

Crucible 

Material 

e-beam b 

Deposition Power 
Rate (A/s) (kW) 

Aluminum 

1010 

Melts 

BN 

20 

5 

Aluminum 






oxide 

1325 

Semimelts 


10 

0.5 

Antimony 

425 

Melts 

BN, Al 2 0 3 

50 

0.5 

Arsenic 

210 

Sublimes 

ai 2 o 3 

100 

0.1 

Beryllium 

1000 

Melts 

Graphite, BeO 

100 

1.5 

Beryllium 






oxide 


Melts 


40 

1.0 

Boron 

1800 

Melts 

Graphite, WC 

10 

1.5 

Boron 






carbide 


Semimelts 


35 

1.0 

Cadmium 

180 

Melts 

Al 2 0 3 , quartz 

30 

0.3 

Cadmium 






sulfide 

250 

Sublimes 

Graphite 

10 

0.25 

Calcium 






fluoride 


Semimelts 


30 

0.05 

Carbon 

2140 

Sublimes 


30 

1.0 

Chromium 

1157 

Sublimes 

W 

15 

0.3 

Cobalt 

1200 

Melts 

A1 2 0 3 , BeO 

20 

2.0 

Copper 

1017 

Melts 

Graphite, Al 2 0 3 

50 

0.2 

Gallium 

907 

Melts 

Al 2 0 3 , graphite 



Ge rmanium 

1167 

Melts 

Graphite 

25 

3.0 

Gold 

1132 

Melts 

ai 2 o 3 , bn 

30 

6.0 

Indium 

742 

Melts 

Aljo, 

100 

0.1 

Iron 

1180 

Melts 

A1 2 0 3 , BeO 

50 

2.5 

Lead 

497 

Melts 

ai 2 o 3 

30 

0.1 

Lithium 


melts 




fluoride 

1180 

(viscous) 

Mo, W 

10 

0.15 

Magnesium 

327 

sublimes 

graphite 

100 

0.04 

Magnesium 






fluoride 

1540 

semimelts 

ai 2 o 3 

30 

0.01 

Molybdenum 

2117 

melts 


40 

4.0 

Nickel 

1262 

melts 

ai 2 o 3 

25 

2.0 

Permalloy 

1300 

melts 

ai 2 o 3 

30 

2.0 

Platinum 

1747 

melts 

graphite 

20 

4.0 

Silicon 

1337 

melts 

BeO 

15 

0.15 

Silicon 






dioxide 

850 

semimelts 

Ta 

20 

0.7 


a Temperature (°C) at which vapor pressure is 10“ 4 torr. 
b For 10 kV, copper hearth, source-substrate distance of 40 cm. 
Adapted from Refs, 16 and 17. 
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Table 3-3. Continued . 


Material 

Minimum 11 
Evap. Temp 

State of 
Evaporation 

Recommended 

Crucible 

Material 

e-beam 6 

Deposition Power 
Rate (A/s) (kW) 

Silicon 






monoxide 

600 

sublimes 

Ta 

20 

0.1 

Tantalum 

2590 

semimelts 


100 

5.0 

Tin 

997 

melts 

A1 2 0 3 , graphite 

10 

2.0 

Titanium 

1453 

melts 


20 

1.5 

Titanium 






dioxide 

1300 

melts 

W 

10 

1.0 

Tungsten 

2757 

melts 


20 

5.5 

Zinc 

250 

sublimes 

ai 2 o 3 

50 

0.25 

Zinc 






selenide 

660 

sublimes 

quartz 



Zinc 






sulfide 

300 

sublimes 

Mo 



Zirconium 

1987 

melts 

W 

20 

5.0 


MATCHING 



DC RF 

Figure 3-13. Schematics of simplified sputtering systems: (a) dc, (b) RF. 


kilovolts are applied to it. The substrate that faces the cathode may be 
grounded, electrically floating, biased positively or negatively, heated, cooled, 
or some combination of these. After evacuation of the chamber, a gas, 
typically argon, is introduced and serves as the medium in which a discharge is 
initiated and sustained. Gas pressures usually range from a few to 100 mtorr. 
After a visible glow discharge is maintained between the electrodes, it is 
observed that a current flows and that a film condenses on the substrate 
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(anode). In vacuum, of course, there is no current flow and no film deposition. 
Microscopically, positive ions in the discharge strike the cathode plate and 
eject neutral target atoms through momentum transfer. These atoms enter and 
pass through the discharge region to eventually deposit on the growing film. In 
addition, other particles (secondary electrons, desorbed gases, and negative 
ions) as well as radiation (X-rays and photons) are emitted from the target. In 
the electric field the negatively charged ions are accelerated toward the 
substrate to bombard the growing film. 

From this simple description, it is quite apparent that compared to the 
predictable rarefied gas behavior in an evaporation system, the glow discharge 
is a very busy and not easily modeled environment. Regardless of the type of 
sputtering, however, roughly similar discharges, electrode configurations, and 
gas-solid interactions are involved. Therefore, issues common to all glow 
discharges will be discussed prior to the detailed treatment required of specific 
sputtering processes and applications. 

3.5.2. DC Glow Discharges 

The manner in which a glow discharge progresses in a low-pressure gas using 
a high-impedance dc power supply is as follows (Refs. 4-6). A very small 
current flows at first due to the small number of initial charge carriers in the 
system. As the voltage is increased, sufficient energy is imparted to the 
charged particles to create more carriers. This occurs through ion collisions 
with the cathode, which release secondary electrons, and by impact ionization 
of neutral gas atoms. With charge multiplication, the current increases rapidly, 
but the voltage, limited by the output impedance of the power supply, remains 
constant. This regime is known as the Townsend discharge. Large numbers of 
electrons and ions are created through avalanches. Eventually, when enough of 
the electrons generated produce sufficient ions to regenerate the same number 
of initial electrons, the discharge becomes self-sustaining. The gas begins to 
glow now, and the voltage drops, accompanied by a sharp rise in current. At 
this state “normal glow*’ occurs. Initially, ion bombardment of the cathode is 
not uniform but is concentrated near the cathode edges or at other surface 
irregularities. As more power is applied, the bombardment increasingly spreads 
over the entire surface until a nearly uniform current density is achieved. A 
further increase in power results in higher voltage and current density levels. 
The “abnormal discharge” regime has now been entered, and this is the 
operative domain for sputtering and other discharge processes (e.g., plasma 
etching of thin films). At still higher currents, low-voltage arcs propagate. 

Adjacent to the cathode there is a highly luminous layer known as the 
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CATHODE NEGATIVE FARADAY ANODE 7 ANODE 


GLOW | GLOW DARK SPACE DARK GLOW 
CATHODE SPACE 

(CROOKES) 

DARK SPACE 

Figure 3-14. Luminous regions of the dc glow discharge. 


cathode glow. The light emitted depends on the incident ions and the cathode 
material. In the cathode glow region, neutralization of the incoming discharge 
ions and positive cathode ions occurs. Secondary electrons start to accelerate 
away from the cathode in this area and collide with neutral gas atoms located 
some distance away from the cathode. In between is the Crookes dark space, a 
region where nearly all of the applied voltage is dropped. Within the dark 
space the positive gas ions are accelerated toward the cathode. 

The next distinctive region is the “negative glow,” where the accelerated 
electrons acquire enough energy to impact-ionize the neutral gas molecules. 
Beyond this is the Faraday dark space and finally the positive column. The 
sequence of these discharge regimes is schematically depicted in Fig. 3-14. 
The substrate (anode) is placed inside the negative glow, well before the 
Faraday dark space so that the latter as well as the positive column do not 
normally appear during sputtering. 


3.5.3. Discharge Species (Ref. 6) 

A discharge is essentially a plasma—i.e., a partially ionized gas composed of 
ions, electrons, and neutral species that is electrically neutral when averaged 
over all the particles contained within. Moreover, the density of charged 
particles must be large enough compared with the dimensions of the plasma so 
that significant Coulombic interaction occurs. This interaction enables the 
charged species to behave in a fluidlike fashion and determines many of the 
plasma properties. The plasmas used in sputtering are called glow discharges. 
In them the particle density is low enough, and the fields are sufficiently strong 
so that neutrals are not in equilibrium with electrons. Typically, the degree of 
ionization or ratio between numbers of ions and neutrals is about 10 -4 . 
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Therefore, at pressures of 10 mtorr, the perfect gas law indicates that about 
3 x 10 9 ions as well as electrons/cm 3 will be present at 25 °C. Measurements 
on glow discharges yield average electron energies of about 2 eV. The 
effective temperature T associated with a given energy E is simply given by 
T = E/k, where k is the Boltzmann constant. Substituting, we find that 
electrons have an astoundingly high temperature of some 23,000 K. However, 
because there are so few of them, their heat content is small and the chamber 
walls do not heat appreciably. Neutrals and ions are not nearly as energetic; 
the former have energies of only 0.025 eV (or T — 290 K) and the latter, 
energies of - 0.04 eV (or T = 460 K). Ions have higher energies than 
neutrals because they acquire energy from the applied electric field. 

Since surfaces (e.g., targets, substrates) are immersed in the plasma, they 
are bombarded by the species present. The neutral particle flux can be 
calculated from Eq. 2-8. Charged particle impingement results in an effective 
current density /, given by the product of the particle flux and the charge q t 
transported. Therefore, 

•/, = «,<7,5,/4. (3-23) 

where n i and v i are the specie concentration and mean velocity, respectively. 
By Eq. 2-3b, iJ, = (8 kT / wm) ] /2 . For electrons nt = 9.1 X 10“ 28 g, and if 
we assume T = 23,000 K and n = 10 lo /cm 3 , / electron — 38 mA/cm 2 . The 
ions, present in the same amounts as electrons, are much heavier and have a 
lower effective temperature than the electrons. This accounts for their very low 
velocity compared with that of electrons. For example, u ion = 5.2 X 10 4 
cm/sec for Ar ions as well as neutral atoms, whereas for electrons i> eleciron = 
9.5 x 10 7 cm/sec. The ion current is correspondingly reduced relative to the 
electron current by the ratio of these velocities, so / ion = 21 jiA/cm 2 . 

The implication of this simple calculation is that an isolated surface within 
the plasma charges negatively initially. Subsequently, additional electrons are 
repelled and positive ions are attracted. Therefore, the surface continues to 
charge negatively at a decreasing rate until the electron flux equals the ion flux 
and there is no net steady-state current. We can then expect that both the anode 
and cathode in the glow discharge will be at a negative potential with respect to 
the plasma. Of course, the application of the large external negative potential 
alters the situation, but the voltage distribution in a dc glow discharge under 
these conditions is shown schematically in Fig. 3-15. A sheath develops around 
each electrode with a net positive space charge. The lower electron density in 
the sheath means less ionization and excitation of neutrals. Hence, there is less 
luminosity there than in the glow itself. Electric fields (derivative of the 
potential) are restricted to the sheath regions. The plasma itself is not at a 
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Figure 3-15. Voltage distribution across dc glow discharge. 

potential intermediate between that of the electrodes but is typically some 10 V 
positive with respect to the anode at zero potential. Sheath width dimensions 
depend on the electron density and temperature. Using the values given earlier 
for the electrically isolated surface, we find that the sheath width is about 100 
fim. It is at the sheath-plasma interface that ions begin to accelerate on their 
way to the target during sputtering; electrons, however, are repelled from both 
sheath regions. All of these unusual charge effects stem from the fact that the 
fundamental plasma particles (electrons and ions) have such different masses 
and, hence, velocities and energies. 


3.5.4. Collision Processes 


Collisions between electrons and all the other species (charged or neutral) 
within the plasma dominate the properties of the glow discharge. Collisions are 
elastic or inelastic, depending on whether the internal energy of the colliding 
specie is preserved. In an elastic collision, exemplified by the billiard ball 
analogy of elementary physics, only kinetic energy is interchanged, and we 
speak of conservation of momentum and kinetic energy of translational 
motion. The potential energy basically resides within the electronic structure 
of the colliding ions, atoms, and molecules, etc., and increases in potential 
energy are manifested by ionization or other excitation processes. In an elastic 
collision, no atomic excitation occurs and potential energy is conserved. This 
is the reason why only kinetic energy is considered in the calculation. The 
well-known result for elastic binary collisions is 


Ei 

Ei 


4 M,M 2 
(M, +M 2 ) 2 


cos 2 0, 


(3-24) 


where 1 and 2 refer to the two particles of mass M i and energy E t . We assume 
M 2 is initially stationary and M { collides with it at an angle $ defined by the 
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initial trajectory and the line joining their centers at contact. The quantity 
4M, M 2 /(A/, + M 2 ) 2 is known as the energy transfer function. When = 
M 2j it has a value of 1; i.e., after collision the initial moving projectile 
remains stationary, and all of its energy is efficiently transferred to the second 
particle, which speeds away. When, however, Af, < Af 2 , reflecting, say, a 
collision between a moving electron and a stationary nitrogen molecule, then 
the energy transfer function is ~4M l /M 2 and has a typical value of 
— 10 ” 4 . Little kinetic energy is transferred in the collision of the light electron 
with the massive nitrogen atom. 

Now consider inelastic collisions. The change in internal energy, A U, of the 
struck particle must now be accounted for in the condition requiring conserva¬ 
tion of total energy. It is left as an exercise for the reader to demonstrate that 
the maximum fraction of kinetic energy transferred is given by 


A U 

(1/2 )M x v] 


M 2 

M l + M 2 


cos 2 6 , 


(3-25) 


where u, is the initial velocity of particle 1. For the inelastic collision between 
an electron and nitrogen molecule, AL//(l/2)A/,u 2 = 1, when cos0 = 1. 
Therefore, contrary to an elastic collision, virtually all of an electron’s kinetic 
energy can be transferred to the heavier species in the inelastic collision. 

We now turn our attention to a summary of the rich diversity of inelastic 
collisions and chemical processes that occur in plasmas. It is well beyond the 
scope of this book to consider anything beyond a cataloging of reactions (Ref. 
6). Suffice it to say that these reactions generally enhance film deposition and 
etching processes. 


1. Ionization . The most important process in sustaining the discharge is 
electron impact ionization. A typical reaction is 

e - + Ar° -► Ar + + 2e”. (3-26a) 

The two electrons can now ionize more Ar°, etc. By this multiplication 
mechanism the glow discharge is sustained. The reverse reaction, in which an 
electron combines with the positive ion to form a neutral, also occurs and is 
known as recombination. 


2. Excitation . In this case the energy of the electron excites quantitized 
transitions between vibrational, rotational, and electronic states, leaving the 
molecule in an excited state (denoted by an asterisk). An example is 

e" + 0 2 ° -> + e“. (3-26b) 

3. Dissociation . In dissociation the molecule is broken into smaller atomic 
or molecular fragments. The products (radicals) are generally much more 
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chemically active than the parent gas molecule and serve to accelerate reac¬ 
tions. Dissociation of CF 4 , for example, is relied on in plasma etching or film 
removal processes; i.e., 

e~+ CF 4 - e"+ CF 3 * + F*. (3-26c) 

4. Dissociative Ionization. During dissociation one of the excited species 
may become ionized; e.g., 

e“+ CF 4 -» 2e“+ CF 3 + + F*. (3-26d) 

5. Electron Attachment. Here neutral molecules become negative ions 
after capturing an electron. For example, 

e"+ SF 6 ° -SF 6 ~. (3-26e) 

6. Dissociative Attachment . 

e“+ N 2 ° N + + N"+ e“. (3-26f) 

In addition to electron collisions, ion-neutral as well as excited or 
metastable-excited, and excited atom-neutral collisions occur. Some generic 
examples of these reactions are as follows: 

7. Symmetrical Charge Transfer . 

A -I- A + -> A + + A. 

8. Asymmetric Charge Transfer. 

A + B + - A + -hB. 

9. Metastable-Neutral. 

A* + B-+B + A + e". 

10. Metastable-Metastable Ionization. 

A* + A* A + A++ e“. 

Evidence for these uncommon gas-phase species and reactions has accumu¬ 
lated through real-time monitoring of discharges by mass as well as light 
emission spectroscopy. As a result, a remarkable picture of plasma chemistry 
has emerged. For example, a noble gas like Ar when ionized loses an electron 
and resembles Cl electronically as well as chemically. The fact that these 
species are not in equilibrium confounds the thermodynamic and kinetic 
descriptions of these reactions. 


3.6. Sputtering 


3.6.1. Ion-Surface Interactions 

Critical to the analysis and design of sputtering processes is an understanding 
of what happens when ions collide with surfaces (Ref. 18). Some of the 
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interactions that occur are shown schematically in Fig. 3-16. Each depends on 
the type of ion (mass, charge), the nature of surface atoms involved, and, 
importantly, on the ion energy. Several of these interactions have been 
capitalized upon in widely used thin-film processing, deposition, and character¬ 
ization techniques. For example, ion implantation involves burial of ions under 
the target surface. Ion implantation of dopants such as B, P, As into Si wafers 
at ion energies ranging from tens to 100 keV is essential in the fabrication of 
devices in very large scale integrated (VLSI) circuits. Even higher energies are 
utilized to implant dopants into GaAs matrices. Ion fluxes, impingement times, 
and energies must be precisely controlled to yield desired doping levels and 
profiles. In contrast, ion-scattering spectroscopy techniques require that the 
incident ions be reemitted for measurement of energy loss. Rutherford 
backscattering (RBS) is the most important of these analytical methods and 
typically relies on 2-MeV He + ions. Through measurement of the intensity of 
the scattered ion signal, it is possible to infer the thickness and composition of 
films as well as subsurface compound layers. This subject is treated at length 
in Chapter 6. Secondary electrons as well as the products of core electron 
excitation—Auger electrons, X-rays, etc.—also form part of the complement 
of particles and radiation leaving the surface. 


3.6.2. Sputter Yield 

When the ion impact establishes a train of collision events in the target leading 
to the ejection of a matrix atom, we speak of sputtering. An impressive body of 
literature has been published indicating that sputtering is related to momentum 
transfer from energetic particles to the surface atoms of the target. Sputtering 
has, therefore, been aptly likened to “atomic pool” where the ion (cue ball) 
breaks up the close-packed rack of atoms (billiard balls), scattering some 
backward (toward the player). Even though atoms of a solid are bound to one 
another by a complex interatomic potential, whereas billiard balls do not 
interact, sputtering theory uses the idea of elastic binary collisions. Theoretical 
expressions for the sputter yield S, the most fundamental parameter character¬ 
izing sputtering, include the previously introduced energy transfer function. 
The sputter yield is defined as the number of atoms or molecules ejected from 
a target surface per incident ion and is a measure of the efficiency of 
sputtering. 

Intuitively we expect S to be proportional to a product of the following 
factors (Ref. 19): 

1. The number of atoms displaced toward the surface per primary collision. 

This term is given by E/2E n where E is the mean energy of the struck 
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target atom and E t is the threshold energy required to displace an atom. 
The factor of 2 is necessary because only half of the displaced atoms move 
toward the surface. The quantity E may be taken as an average of E 2 , the 
kinetic energy transferred to the target atom, and E t ; i.e., 

_ E 2 “h Ef 

E = -, where E 2 = 

2 


4 M l M 2 

(AT, + A * 2 ) 2 '' 


2. The number of atomic layers that contain these atoms and contribute to 
sputtering. Statistics show that the number of collisions required to slow an 
atom of energy E to E b , the surface binding energy, is 


N = 


In E/E b 
In 2 


(3-27) 


By a random walk model, the average number of contributing atomic layers 
is 1 + N ]/2 . 

3. The number of target atoms per unit area n A . 

4. The cross section a 0 = ira 2 , where a is related to the Bohr radius of the 
atom a b , and the atomic numbers Z x , Z 2 of the incident ion and sputtered 
atom respectively; i.e., 

/zf 3 + zf ' 3 ' 


Combining terms gives 


5 





o- 0 « A . 


(3-28) 


As an example consider the sputtering of Cu with 1-keV Ar ions. The 
calculated value of 5 will depend strongly on E n and for Ar incident on Cu 
experiment suggests that E t — 17 eV. For Cu, M 2 ~ 63.5, Z 2 = 29, a b = 
1.17 A, n A = 1.93 X 10 15 atoms/cm 2 , and E b = 3.5 eV. For Ar, A/, = 39.9 
and Z, = 18. Substitution shows that E — 483 eV and S = 2.6. This calcu¬ 
lated value compares with the measured sputter yield of 2.85, as indicated by 
the data of Table 3-4. 

The currently accepted theory for the sputtering yield from collision cas¬ 
cades is due to Sigmund (Ref. 20) and predicts that 

3a AM x M 2 E x 

S ~ 4ir 2 (M, + M 2 ) 2 


(£, < 1 keV) 


(3-29) 
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Table 3-4. Sputtering Yield Data for Metals (atoms/ion) 


Sputtering Gas 
Energy (keV) 

He 

0.5 

Nc 

0.5 

Ar 

0.5 

Kr 

0.5 

Xe 

0.5 

Ar 

1.0 

Ar 

Threshold 
Voltage (eV) 

Ag 

0.20 

1.77 

3.12 

3.27 

3.32 

3.8 

15 

A1 

0.16 

0.73 

1.05 

0.96 

0.82 

1.0 

13 

Au 

0.07 

1.08 

2.40 

3.06 

3.01 

3.6 

20 

Be 

0.24 

0.42 

0.51 

0.48 

0.35 


15 

C 

0.07 

— 

0.12 

0.13 

0.17 



Co 

0.13 

0.90 

1.22 

1.08 

1.08 


25 

Cu 

0.24 

1.80 

2.35 

2.35 

2.05 

2.85 

17 

Fe 

0.15 

0.88 

1.10 

1.07 

1.00 

1.3 

20 

Ge 

0.08 

0.68 

1.1 

1.12 

1.04 


25 

Mo 

0.03 

0.48 

0.80 

0.87 

0.87 

1.13 

24 

Ni 

0.16 

1.10 

1.45 

1.30 

1.22 

2.2 

21 

Pt 

0.03 

0.63 

1.40 

1.82 

1.93 


25 

Si 

0.13 

0.48 

0.50 

0.50 

0.42 

0.6 


Ta 

0.01 

0.28 

0.57 

0.87 

0.88 


26 

Ti 

0.07 

0.43 

0.51 

0.48 

0.43 


20 

W 

0.01 

0.28 

0.57 

0.91 

1.01 


33 


From Refs. 4 and 6. 


and 

„ Z,z 2 / M, ) s„(g) 

“ Z 3/3 + Z | /3 \ A/, + M 2 ] E b 


(is, > 1 keV). (3-30) 


These equations depend on two complex quantities, a and S n (E). The 
parameter a, a measure of the efficiency of momentum transfer in collisions, 
increases monotonically from 0.17 to 1.4 as M x /M 2 ranges from 0.1 to 10. 
The reduced stopping power, S n (E ), is a measure of the energy loss per unit 
length due to nuclear collisions. It is a function of the energy as well as masses 
and atomic numbers of the atoms involved. At high energy, S is relatively 
constant because S n (E ) tends to be independent of energy. 

The sputter yields for a number of metals are entered in Table 3-4. Values 
for two different energies (0.5 keV and 1.0 keV) as well as five different inert 
gases (He, Ne, Ar, Kr, and Xe) are listed. It is apparent that S values typically 
span a range from 0.01 to 4 and increase with the mass and energy of the 
sputtering gas. 


3.6.3. Sputtering of Alloys 

In contrast to the fractionation of alloy melts during evaporation, with subse¬ 
quent loss of deposit stoichiometry, sputtering allows for the deposition of 
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films having the same composition as the target source. This is the primary 
reason for the widespread use of sputtering to deposit metal alloy films. We 
note, however, that each alloy component evaporates with a different vapor 
pressure and sputters with a different yield. Why then is film stoichiometry 
maintained during sputtering and not during evaporation? One reason is the 
generally much greater disparity in vapor pressures compared with the differ¬ 
ence in sputter yields under comparable deposition conditions. Second, and 
perhaps more significant, melts homogenize readily due to rapid atomic 
diffusion and convection effects in the liquid phase; during sputtering, how¬ 
ever, minimal solid-state diffusion enables the maintenance of the required 
altered target surface composition. 

Consider now sputtering effects (Ref. 5) on a binary alloy target surface 
containing a number of A atoms (n A ) and B atoms (n B ), such that the total 
number is n = n A + n B . The target concentrations are C A = n A /n and 
C B = n B /n, with sputter yields S A and S B . Initially, the ratio of the sputtered 
atom fluxes (^) is given by 

^B ^ B C B 


If n g sputtering gas atoms impinge on the target, the total number of A and B 
atoms ejected are n g C A S A and «^C B 5 B , respectively. Therefore, the target 
surface concentration ratio is modified to 


^a _ Q\ 0 n gS\i n ) 
C& 0 ~~ n g S *l n ) 


instead of C A /C B . If S A > S B , the surface is enriched in B atoms, which 
now begin to sputter in greater profusion; i.e., 


= = S A C A (1 - n J SJn) 

Sb^B (1 — n gSB / n ) 


(3-33) 


Progressive change in the target surface composition alters the sputtered flux 
ratio to the point where it is equal to C A / C B , which is the same as the original 
target composition. Simultaneously, the target surface reaches the value 
C A / C B = C A S B /C B S A , which is maintained thereafter. A steady-state trans¬ 
fer of atoms from the bulk target to the plasma ensues, resulting in stoichio¬ 
metric film deposition. This state of affairs persists until the target is con¬ 
sumed. Conditioning of the target by sputtering a few hundred atom layers is 
required to reach steady-state conditions. As an explicit example, consider the 
deposition of Permalloy films having atomic ratio 80 Ni-20 Fe from a target 
of this same composition. For 1-keV Ar, the sputter yields are S Ni = 2.2 and 
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S Fe = 1.3. The target surface composition is altered in the steady state to 
CJ^/Cpe = 80(1.3)/20(2.2) = 2.36, which is equivalent to 70.2 Ni and 29.8 
Fe. 


3.6.4. Thermal History of the Substrate (Ref. 21) 


One of the important issues related to sputtering is the temperature rise in the 
substrate during film deposition. Sputtered atoms that impinge on the substrate 
are far more energetic than similar atoms emanating from an evaporation 
source. During condensation, this energy must be dissipated by the substrate, 
or else it may heat excessively, to the detriment of the quality of the deposited 
film. To address the question of substrate heating, we start with an equation 
describing the heat power balance, namely, 

pcd(dT/dt) = P- L. (3-34) 

The term on the left is the net thermal energy per unit area per unit time (in 
typical units of watts/cm 1 2 ) retained by a substrate whose density, heat 
capacity, effective thickness, and rate of temperature rise are given by p, c, d y 
and dT/dt y respectively. 

The incident power flux P has three important components: 


1. Heat of condensation of atoms, A H c (eV/atom). 

2. Average kinetic energy of incident adatoms, E k (eV/atom). 

3. Plasma heating from bombarding neutrals and electrons. The plasma energy 
is assumed to be E p (eV/atom). 


Table 3-5 contains values for these three energies during magnetron sputter¬ 
ing at 1 keV (Ref. 22). For a deposition rate d (A/min), 


P = 


2.67 X KT 29 </(A// c + E k 4- E p ) 


Q 


watts/cm 2 


(3-35) 


where 12 is the condensate atomic volume in cm 3 /atom. The L term represents 
the heat loss to the substrate holder by conduction or to cooler surfaces in the 
chamber by radiation. For the moment, let us neglect L and calculate the 
temperature rise of a thermally isolated substrate. Substituting Eq. 3-35 into 
Eq. 3-34 and integrating, we obtain 

/N 2.67 x \0~ 29 d(AH c + E k + E p )t 

r <'>-ss--■ (3 - 36) 

o 

Consider A1 deposited at a rate of 10,000 A/min on a Si wafer 0.050 cm 
thick. For Al, A H c + E k + E p = 13 eV/atom and Q = 16 x 10 -24 



116 


Physical Vapor Deposition 


Table 3-5. Energies Associated with Magnetron Sputttering 


Metal 

Heat of 

Condensation 

(eV/atom) 

Kinetic Energy of 
Sputtered Atoms 
(eV/atom) 

Plasma 
(eV/atom) 

Estimated 

Flux 

(eV/atom) 

Measured 

Flux 

(eV/atom) 

A1 

3.33 

6 

4 

13 

13 

Ti 

4.86 

8 

9 

22 

20 

V 

5.29 

7 

8 

20 

19 

Cr 

4.11 

8 

4 

16 

20 

Fe 

2.26 

9 

4 

15 

21 

Ni 

4.45 

11 

4 

19 

15 

Cu 

3.50 

6 

2 

12 

17 

Zr 

6.34 

13 

7 

26 

41 

Nb 

6.50 

13 

8 

28 

28 

Mo 

6.88 

13 

6 

26 

47 

Rh 

5.60 

13 

4 

23 

43 

Cd 

1.16 

4 

1 

6 

8 

In 

2.52 

4 

2 

9 

20 

Hf 

6.33 

20 

7 

33 

63 

Ta 

8.10 

21 

9 

38 

68 

W 

8.80 

22 

9 

40 

73 

Au 

3.92 

13 

2 

19 

23 


From Ref. 22. 


cm 3 /atom, and for Si, p = 2.3 g/cm 3 and c = 0.7 J/g-°C. In depositing a 
film 1 fx m thick, t = 60 sec, and the temperature rise of the substrate is 
calculated to be 162 °C. Higher deposition rates and substrates of smaller 
thermal mass will result in proportionately higher temperatures. 

The temperature will not reach values predicted by Eq. 3-36 because of L. 
For simplicity we only consider heat loss by radiation. If the front and rear 
substrate surfaces radiate to identical temperature sinks at T 0 with equal 
emissivity e, then L = 2ae(T 4 - Tq), where a, the Stefan-Boltzmann con¬ 
stant, equals 5.67 X 10” 12 W/cm 2 -K 4 . Substitution in Eq. 3-34 and direct 
integration, after separation of variables, yields 


t = 


1 

2 a 3 / 2 ^/ 2 


tan 1 



1 \fa. + \/]3 T 

\fa - -v//3 T 0 \ 

\ sfa. + i/iS T o 

4- 


(3-37) 


where a = \J(IosTq + P)/pcd and /3 = \/2oe/pcd . 
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T(°C) 



ELAPSED TIME (SEC) 

Figure 3-17. Temperature-time response for film-substrate combination under the 
influence of a power flux of 250 mW/cm 2 . Deposition rate — 1 ^m/min. (Reprinted 
with permission from Cowan Publishing Co., from L. T. Lamont, Solid State 
Technology 22(9), 107, 1979). 

Equation 3-37 expresses the time it takes for a substrate to reach temperature 
T starting from T 0 , assuming radiation cooling. For short times Eq. 3-36 
holds, whereas for longer times the temperature equilibrates to a radiation- 
limited value dependent on the incident power flux and substrate emissivity. 
Sputter deposition for most materials at the relatively high rate of 1 /xm/min 
generates a typical substrate power flux of — 250 mW/cm 2 . The predicted 
rate of film heating is shown in Fig. 3-17. If substrate bias (Section 3.7.5.) is 
also applied, temperature increases can be quite substantial. In A1 films, 
temperatures in excess of 200 °C have been measured. This partially accounts 
for enhanced atom mobility and step coverage during application of substrate 
bias. 

Finally, we briefly consider sputter etching, a process that occurs at the 
target during sputtering. Films utilized in microelectronic applications must be 
etched in order to remove material and expose patterned regions for subsequent 
film deposition or doping processes. In the VLSI regime etching is carried out 
in plasmas and reactive gas environments, where the films involved essentially 
behave like sputtering targets. At the same power level, sputter etching rates 
tend to be lower, by more than an order of magnitude, than film deposition 
rates. This means that etching requires high power levels that frequently range 
from 1 to 2 W/cm 2 . The combination of high power levels and long etching 
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times cause substrates to reach high radiation-limited temperatures. In Al, for 
example, temperature increases well in excess of 300 °C have been measured 
during etching. 


3.7. Sputtering Processes 


For convenience we divide sputtering processes into four cateogories: (1) dc, 
(2) RF, (3) magnetron, (4) reactive. We recognize, however, that there are 
important variants within each category (e.g., dc bias) and even hybrids 
between categories (e.g., reactive RF). Targets of virtually all important 
materials are commercially available for use in these sputtering processes. A 
selected number of target compositions representing the important classes 
of solids, together with typical sputtering applications for each are listed in 
Table 3-6. 

In general, the metal and alloy targets are fabricated by melting either in 
vacuum or under protective atmospheres, followed by thermomechanical pro¬ 
cessing. Refractory alloy targets (e.g., Ti-W) are hot-pressed via the powder 
metallurgy route. Similarly, nonmetallic targets are generally prepared by 
hot-pressing of powders. The elemental and metal targets tend to have purities 
of 99.99% or better, whereas those of the nonmetals are generally less pure, 
with a typical upper purity limit of 99.9%. In addition, less than theoretical 
densities are achieved during powder processing. These metallurgical realities 
are sometimes reflected in emission of particulates, release of trapped gases, 
nonuniform target erosion, and deposited films of inferior quality. Targets are 
available in a variety of shapes (e.g., disks, toroids, plates, etc.) and sizes. 
Prior to use, they must be bonded to a cooled backing plate to avoid thermal 
cracking. Metal-filled epoxy cements of high thermal conductivity are em¬ 
ployed for this purpose. 

3.7.1. DC Sputtering 

Virtually everything mentioned in the chapter so far has dealt with dc sputter¬ 
ing, also known as diode or cathodic sputtering. There is no need to further 
discuss the system configuration (Fig. 3-13), the discharge environment (Sec¬ 
tion 3.5), the ion-surface interactions (Section 3.6.1), or intrinsic sputter 
yields (Section 3.6.2). It is worthwhile, however, to note how the relative film 
deposition rate depends on the sputtering pressure and current variables. At 
low pressures, the cathode sheath is wide and ions are produced far from the 
target; their chances of being lost to the walls are great. The mean-free 
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Table 3-6. Sputtering Targets 


Material 

Application 

1. Metals 


Aluminum 

Metallization for integrated circuits, front surface mirrors 

Chromium 

Adhesion layers, resistor films (with SiO lithography 


master blanks 

Germanium 

Infrared filters 

Gold 

Contacts, reflecting films 

Iron, nickel 

Ferromagnetic films 

Palladium, platinum 

Contacts 

Silver 

Reflective films, contacts 

Tantalum 

Thin-film capacitors 

Tungsten 

Contacts 

2. Alloys 


Al-Cu, Al-Si, Al-Cu-Si Metallization for integrated circuits 

Co-Fe, Co-Ni, Fe-Tb, 

Ferromagnetic films 

Fe-Ni, Co-Ni-Cr 


Ni-Cr 

Resistors 

Ti-W 

Diffusion barriers in integrated circuits 

Gd-Co 

Magnetic bubble memory devices 

3. Oxides 


A1 2 0 3 

Insulation, protective films for mirrors 

BaTi0 3 , PbTi0 3 

Thin-film capacitors 

Ce0 2 

Antireflection coatings 

I n 2^3 _ Sn0 2 

Transparent conductors 

LiNb0 3 

Piezoelectric films 

Si0 2 

Insulation 

SiO 

Protective films for mirrors, infrared filters 

Ta 2 O s , Ti0 2 , Zr0 2 , 

Dielectric films for multilayer optical coatings 

Hfl0 2 , MgO 


Yttrium aluminum oxide 

Magnetic bubble memory devices 

(YAG), yttrium iron 


oxide (Y1G), Gd 3 Ga 5 0 12 


YV0 3 -Eu 2 0 3 

Phosphorescent coating on special currency papers 

Cu 3 Ba 2 YO ? 

High temperature superconductors 

4. Fluorides 


CaF 2 , CeF 3 , MgF 2 , 

Dielectric films for multilayer optical coatings 

ThF 4 , Na 3 AlF 6 

(antireflection coatings, filters, etc.) 

(cryolite) 


5. Borides 


TiB 2 , ZrB 2 

Hard, wear-resistant coatings 

LaBfi 

Thermionic emitters 

6. Carbides 


SiC 

High-temperature semiconduction 

TiC, TaC, WC 

Hard, wear-resistant coatings 
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Table 3-6. Continued . 



Material 

Application 

7. 

Nitrides 



Si 3 N 4 

Insulation, diffusion barriers 


TaN 

Thin-film resistors 


TiN 

Hard coatings 

8. 

Sidcides 



MoSi 2 , TaSi 2 , TiSi 2 , 
WSi 2 

Contacts, diffusion barriers in integrated circuits 

9. 

Sulfides 



CdS 

Photoconductive films 


MoS 2 , TaS 2 

Lubricant films for bearings and moving parts 


ZnS 

Multilayer optical coatings 

10. 

Selenides , tellurides 



CdSe, PbSe, CdTe 

Photoconductive films 


ZnSe, PbTe 

Optical coatings 


MoTe, MoSe 

Lubricants 



LU 


< 

CL 
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Figure 3-18. Influence of working pressure and current on deposition rate for 
nonmagnetron sputtering. (From Ref. 23). 
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electron path between collisions is large, and electrons collected by the anode 
are not replenished by ion-impact-induced cathode secondary emission. There¬ 
fore, ionization efficiencies are low, and self-sustained discharges cannot be 
maintained below about 10 mtorr. As the pressure is increased at a fixed 
voltage, the electron mean-free path is decreased, more ions are generated, and 
larger currents flow. But if the pressure is too high, the sputtered atoms 
undergo increased collisional scattering and are not efficiently deposited. The 
trade-offs in these opposing trends are shown in Fig. 3-18, and optimum 
operating conditions are shaded in. In general, the deposition rate is propor¬ 
tional to the power consumed, or to the square of the current density, and 
inversely dependent on the electrode spacing. 

3.7.2. RF Sputtering 

RF sputtering was invented as a means of depositing insulating thin films. 
Suppose we wish to produce thin Si0 2 films and attempt to use a quartz disk 
0.1 cm thick as the target in a conventional dc sputtering system. For quartz 
p » 10 16 Q-cm. To draw a current density J of 1 mA/cm 2 , the cathode needs 
a voltage V = OApJ. Substitution gives an impossibly high value of 10 12 V, 
which indicates why dc sputtering will not work. If we set a convenient level 
of V = 100 V, it means that a target with a resistivity exceeding 10 6 Q-cm 
could not be dc-sputtered. 

Now consider what happens when an ac signal is applied to the electrodes. 
Below about 50 kHz, ions are sufficiently mobile to establish a complete 
discharge at each electrode on each half-cycle. Direct current sputtering 
conditions essentially prevail at both electrodes, which alternately behave as 
cathodes and anodes. Above 50 kHz two important effects occur. Electrons 
oscillating in the glow region acquire enough energy to cause ionizing colli¬ 
sions, reducing the need for secondary electrons to sustain the discharge. 
Secondly, RF voltages can be coupled through any kind of impedance so that 
the electrodes need not be conductors. This makes it possible to sputter any 
material irrespective of its resistivity. Typical RF frequencies employed range 
from 5 to 30 MHz. However, 13.56 MHz has been reserved for plasma 
processing by the Federal Communications Commission and is widely used. 

RF sputtering essentially works because the target self-biases to a negative 
potential. Once this happens, it behaves like a dc target where positive ion 
bombardment sputters away atoms for subsequent deposition. Negative target 
bias is a consequence of the fact that electrons are considerably more mobile 
than ions and have little difficulty in following the periodic change in the 
electric field. In Fig. 3-13b we depict an RF sputtering system schematically. 
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(b) 

Figure 3-19. Formation of pulsating negative sheath on capacitively coupled cathode 
of RF discharge (a) Net current/zero self-bias voltage, (b) Zero current/nonzero 
self-bias voltage. (From Ref. 4). 


where the target is capacitively coupled to the RF generator. The disparity in 
electron and ion mobilities means that isolated positively charged electrodes 
draw more electron current than comparably isolated negatively charged 
electrodes draw positive ion current. For this reason the discharge current- 
voltage characteristics are asymmetric and resemble those of a leaky rectifier 
or diode. This is indicated in Fig. 3-19, and even though it applies to a dc 
discharge, it helps to explain the concept of self-bias at RF electrodes. 

As the pulsating RF signal is applied to the target, a large initial electron 
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current is drawn during the positive half of the cycle. However, only a small 
ion current flows during the second half of the cycle. This would enable a net 
current averaged over a complete cycle to be different from zero; but this 
cannot happen because no charge can be transferred through the capacitor. 
Therefore, the operating point on the characteristic shifts to a negative voltage 
—the target bias—and no net current flows. 

The astute reader will realize that since ac electricity is involved, both 
electrodes should sputter. This presents a potential problem because the 
resultant film may be contaminated as a consequence. For sputtering from only 
one electrode, the sputter target must be an insulator and be capacitively 
coupled to the RF generator. The equivalent circuit of the sputtering system 
can be thought of as two series capacitors—one at the target sheath region, the 
other at the substrate—with the applied voltage divided between them. Since 
capacitive reactance is inversely proportional to the capacitance or area, more 
voltage will be dropped across the capacitor of a smaller surface area. 
Therefore, for efficient sputtering the area of the target electrode should be 
small compared with the total area of the other, or directly coupled, electrode. 
In practice, this electrode consists of the substrate stage and system ground, 
but it also includes baseplates, chamber walls, etc. It has been shown that the 
ratio of the voltage across the sheath at the (small) capacitively coupled 
electrode ( V c ) to that across the (large) directly coupled electrode V d is given 
by (Ref. 24) 

VJV d = (A d /A c )\ (3-38) 

where A c and A d are the respective electrode areas. In essence, a steady-state 
voltage distribution prevails across the system similar to that shown in Fig. 
3-15. The fourth-power dependence means that a large value of A d is very 
effective in raising the target sheath potential while minimizing ion bombard¬ 
ment of grounded fixtures. 

3.7.3. Magnetron Sputtering 

3.7. 3.1. Electron Motion in Parallel Electric and Magnetic Fields . Let 

us now examine what happens when a magnetic field of strength B is 
superimposed on the electric field S between the target and substrate. Such a 
situation arises in magnetron sputtering as well as in certain plasma etching 
configurations. Electrons within the dual field environment experience the 
well-known Lorentz force in addition to electric field force, i.e., 

mdv 

F- — = -q(* + v x B), (3-39) 

dt 

where q y m and v are the electron charge, mass, and velocity, respectively. 
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- e 

I -►B I 

Figure 3-20. Effect of $ and B on electron motion, (a) Linear electron trajectory 
when £ II B (6 = 0); (b) helical orbit of constant pitch when B =£ 0, $ = 0, (0 =£ 0); 
(c) helical orbit of variable pitch when <f II B (6 =£ 0). 


First consider the case where B and $ are parallel as shown in Fig. 3-20a. 
When electrons are emitted exactly normal to the target surface and parallel to 
both fields, then v X B vanishes; electrons are only influenced by the $ field, 
which accelerates them toward the anode. Next consider the case where the $ 
field is neglected but B is still applied as shown in Fig. 3-20b. If an electron is 
launched from the cathode with velocity v at angle 0 with respect to B, it 
experiences a force gvB sin 0 in a direction perpendicular to B. The electron 
now orbits in a circular motion with a radius r that is determined by a balance 
of the centrifugal (m(v sin 0) 2 /r) and Lorentz forces involved, i.e., r = 
mv sin 0 / qB. The electron motion is helical; in corkscrew fashion it spirals 
down the axis of the discharge with constant velocity v cos 0. If the magnetic 
field were not present, such off-axis electrons would tend to migrate out of the 
discharge and be lost at the walls. 

The case where electrons are launched at an angle to parallel, uniform $ 
and B fields is somewhat more complex. Corkscrew motion with constant 
radius occurs, but because of electron acceleration in the $ field, the pitch of 
the helix lengthens with time (Fig. 3-20c). Time varying S fields complicate 
matters further and electron spirals of variable radius can occur. Clearly, 



3.7 Sputtering Processes 


125 


magnetic fields prolong the electron residence time in the plasma and thus 
enhance the probability of ion collisions. This leads to larger discharge 
currents and increased sputter deposition rates. Comparable discharges in a 
simple diode-sputtering configuration operate at higher currents and pressures. 
Therefore, applied magnetic fields have the desirable effect of reducing 
electron bombardment of substrates and extending the operating vacuum range. 


3.7.3.2. Perpendicular Electric and Magnetic Fields . In magnetrons, 
electrons ideally do not even reach the anode but are trapped near the target, 
enhancing the ionizing efficiency there. This is accomplished by employing a 
magnetic field oriented parallel to the target and perpendicular to the electric 
field, as shown schematically in Fig. 3-21. Practically, this is achieved by 
placing bar or horseshoe magnets behind the target. Therefore, the magnetic 
field lines first emanate normal to the target, then bend with a component 
parallel to the target surface (this is the magnetron component) and finally 
return, completing the magnetic circuit. Electrons emitted from the cathode are 
initially accelerated toward the anode, executing a helical motion in the 
process; but when they encounter the region of the parallel magnetic field, they 
are bent in an orbit back to the target in very much the same way that electrons 
are deflected toward the hearth in an e-gun evaporator. By solving the coupled 
differential equations resulting from the three components of Eq. 3-39, we 
readily see that the parameric equations of motion are 


y = 



COS 0> c l), 


X 


ill sin o) c t \ 


(3-40a) 

(340b) 



Figure 3-21. Applied fields and electron motion in the planar magnetron. 
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where y and x are the distances above and along the target, and w c = qB/m. 
These equations describe a cycloidal motion that the electrons execute within 
the cathode dark space where both fields are present. If, however, electrons 
stray into the negative glow region where the @ field is small, the electrons 
describe a circular motion before collisions may drive them back into the dark 
space or forward toward the anode. By suitable orientation of target magnets, a 
“race track” can be defined where the electrons hop around at high speed. 
Target erosion by sputtering occurs within this track because ionization of the 
working gas is most intense above it. 

Magnetron sputtering is presently the most widely commercially practiced 
sputtering method. The chief reason for its success is the high deposition rates 
achieved (e.g., up to 1 ^m/min for Al). These are typically an order of 
magnitude higher than rates attained by conventional sputtering techniques. 
Popular sputtering configurations utilize planar, toroidal (rectangular cross 
section), and toroidal-conical (trapezoidal cross section) targets (i.e., the 
S-gun). In commercial planar magnetron sputtering systems, the substrate 
plane translates past the parallel facing target through interlocked vacuum 
chambers to allow for semicontinuous coating operations. The circular 
(toridal-conical) target, on the other hand, is positioned centrally within the 
chamber, creating a deposition geometry approximating that of the analogous 
planar (ring) evaporation source. In this manner wafers on a planetary sub¬ 
strate holder can be coated as uniformly as with e-gun sources. 

3.7.4. Reactive Sputtering 

In reactive sputtering, thin films of compounds are deposited on substrates by 
sputtering from metallic targets in the presence of a reactive gas, usually mixed 
with the inert working gas (invariably Ar). The most common compounds 
reactively sputtered (and the reactive gases employed) are briefly listed: 

1. Oxides (oxygen)—A1 2 0 3 , ln 2 0 3 , Sn0 2 , Si0 2 , Ta 2 O s 

2. Nitrides (nitrogen, ammonia)—TaN, TiN, AIN, Si 3 N 4 

3. Carbides (methane, acetylene, propane)—TiC, WC, SiC 

4. Sulfides (H 2 S)—CdS, CuS, ZnS 

5. Oxycarbides and oxynitrides of Ti, Ta, Al, and Si 

Irrespective of which of these materials is considered, during reactive 
sputtering the resulting film is either a solid solution alloy of the target metal 
doped with the reactive element (e.g., TaN 001 ), a compound (e.g., TiN), or 
some mixture of the two. Westwood (Ref. 25) has provided a useful way to 
visualize the conditions required to yield alloys or compounds. These two 
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a. 



REACTIVE GAS FLOW 

b. 

Figure 3-22. (a) Generic hysteresis curve for system pressure vs. reactive gas flow 

rate during reactive sputtering. Dotted line represents behavior with inert gas. (From 
Ref. 25). (b) Hysteresis curve of cathode voltage vs. reactive gas flow rate at constant 
discharge current. 

regimes are distinguished in Fig. 3-22a, illustrating the generic hysteresis 
curve for the total system pressure (P) as a function of the flow rate of 
reactive gas ( Q r ) into the system. First, however, consider the dotted line 
representing the variation of P with flow rate of an inert sputtering gas (Q ; ). 
Clearly, as Q x increases, P increases because of the constant pumping speed 
(see Eq. 2-16). An example of this characteristic occurs during Ar gas 
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sputtering of Ta. Now consider what happens when reactive N 2 gas is 
introduced into the system. As Q r increases from Q r (0), the system pressure 
essentially remains at the initial value P 0 because N 2 reacts with Ta and is 
removed from the gas phase. But beyond a critical flow rate Q*, the system 
pressure jumps to the new value P x . If no reactive sputtering took place, P 
would be somewhat higher (i.e., P 3 ). Once the equilibrium value of P is 
established, subsequent changes in Q r cause P to increase or decrease linearly 
as shown. As Q r decreases sufficiently, P again reaches the initial pressure. 

The hysteresis behavior represents two stable states of the system with a 
rapid transition between them. In state A there is little change in pressure, 
while for state B the pressure varies linearly with Q r . Clearly, all of the 
reactive gas is incorporated into the deposited film in state A—the doped metal 
and the atomic ratio of reactive gas dopant to sputtered metal increases with 
Q r . The transition from state A to state B is triggered by compound formation 
on the metal target. Since ion-induced secondary electron emission is usually 
much higher for compounds than for metals, Ohm’s law suggests that the 
plasma impedance is effectively lower in state B than in state A. This effect is 
reflected in the hysteresis of the target voltage with reactive gas flow rate, as 
schematically depicted in Fig. 3-22b. 

The choice of whether to employ compound targets and sputter directly or 
sputter reactively is not always clear. If reactive sputtering is selected, then 
there is the option of using simple dc diode, RF, or magnetron configurations. 
Many considerations go into making these choices, and we will address some 
of them in turn. 

3 . 7 . 4 . 7 . Target Purity . It is easier to manufacture high-purity metal targets 
than to make high-purity compound targets. Since hot pressed and sintered 
compound powders cannot be consolidated to theoretical bulk densities, incor¬ 
poration of gases, porosity, and impurities is unavoidable. Film purity using 
elemental targets is high, particularly since high-purity reactive gases are 
commercially available. 

3 . 7 . 4 . 2 . Deposition Rates . Sputter rates of metals drop dramatically when 
compounds form on the targets. Decreases in deposition rate well in excess of 
50% occur because of the lower sputter yield of compounds relative to metals. 
The effect is very much dependent on reactive gas pressure. In dc discharges, 
sputtering is effectively halted at very high gas pressures, but the limits are 
also influenced by the applied power. Conditioning of the target in pure Ar is 
required to restore the pure metal surface and desired deposition rates. Where 
high deposition rates are a necessity, the reactive sputtering mode of choice is 
either dc or RF magnetron. 
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h- 

Figure 3-23. Influence of nitrogen on composition, resistivity, and temperature 
coefficient of resistivity of Ta films. (From Ref 26). 


3.7.4.3. Stoichiometry and Properties . Considerable variation in the 
composition and properties of reactively sputtered films is possible, depending 
on operating conditions. The case of tantalum nitride is worth considering in 
this regard. One of the first electronic applications of reactive sputtering 
involved deposition of TaN resistors employing dc diode sputtering at voltages 
of 3-5 kV, and pressures of about 30 x 10~ 3 torr. The dependence of the 
resistivity of “tantalum nitride” films is shown in Fig. 3-23, where either Ta, 
Ta 2 N, TaN, or combinations of these form as a function of N 2 partial 
pressure. Color changes accompany the varied film stoichiometries. For 
example, in the case of titanium nitride films, the metallic color of Ti gives 
way to a light gold, then a rose, and finally a brown color with increasing 
nitrogen partial pressure. 


3.7.5. Bias Sputtering 

In bias sputtering, electric fields near the substrate are modified in order to 
vary the flux and energy of incident charged species. This is achieved by 
applying either a negative dc or RF bias to the substrate. With target voltages 
of - 1000 to -3000 V, bias voltages of -50 to -300 V are typically used. 
Due to charge exchange processes in the anode dark space, very few discharge 
ions strike the substrate with full bias voltage. Rather a broad low energy 
distribution of ions and neutrals bombard the growing film. The technique has 
been utilized in all sputtering configurations (dc, RF, magnetron, and reactive). 
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SUBSTRATE BIAS (-VOLTS) 

o 

Figure 3-24. Resistivity of Ta films vs. substrate bias voltage; dc bias (3000 A 

thick). (From Ref. 27). RF bias (1600 A thick). (From Ref. 28). 

Bias sputtering has been effective in altering a broad range of properties in 

deposited films. As specific examples we cite (Refs. 4-6). 

a. Resistivity— A significant reduction in resistivity has been observed in 
metal films such as Ta, W, Ni, Au, and Cr. The similar variation in Ta 
film resistivity with dc or RF bias shown in Fig. 3-24 suggests that a 
common mechanism, independent of sputtering mode, is operative. 

b. Hardness and Residual Stress— The hardness of sputtered Cr has been 
shown to increase (or decrease) with magnitude of negative bias voltage 
applied. Residual stress is similarly affected by bias sputtering. 

c. Dielectric Properties— Increasing RF bias during RF sputtering of Si0 2 
films has resulted in decreases in relative dielectric constant, but increases 
in resistivity. 

d. Etch Rate —The wet chemical etch rate of reactively sputtered silicon 
nitride films is reduced with increasing negative bias. 

e. Optical Reflectivity— Unbiased films of W, Ni, and Fe appear dark gray 
or black, whereas bias-sputtered films display metallic luster. 

f. Step Coverage—Substantial improvement in step coverage of A1 accompa¬ 
nies application of dc substrate bias. 
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g. Film morphology —The columnar microstructure of RF-sputtered Cr is 
totally disrupted by ion bombardment and replaced instead by a compacted, 
fine-grained structure (Ref. 18). 

h. Density —Increased film density has been observed in bias-sputtered Cr 
(Ref. 18). Lower pinhole porosity and corrosion resistance are manifesta¬ 
tions of the enhanced density. 

i. Adhesion —Film adhesion is normally improved with ion bombardment of 
substrates during initial stages of film formation. 

Although the details are not always clearly understood, there is little doubt 
that bias controls the film gas content. For example, chamber gases (e.g., Ar, 
0 2 , N 2 , etc.) sorbed on the growing film surface may be resputtered during 
low-energy ion bombardment. In such cases both weakly bound physisorbed 
gases (e.g., Ar) or strongly attached chemisorbed species (e.g., O or N on Ta) 
apparently have large sputtering yields and low sputter threshold voltages. In 
other cases, sorbed gases may have anomalously low sputter yields and will be 
incorporated within the growing film. In addition, energetic particle bombard¬ 
ment prior to and during film formation and growth promotes numerous 
changes and processes at a microscopic level, including removal of contami¬ 
nants, alteration of surface chemistry, enhancement of nucleation and renucle- 
ation (due to generation of nucleation sites via defects, implanted, and recoil- 
implanted species), higher surface mobility of adatoms, and elevated film 
temperatures with attendant acceleration of atomic reaction and interdiffusion 
rates. Film properties are then modified through roughening of the surface, 
elimination of interfacial voids and subsurface porosity, creation 
of a finer, more isotropic grain morphology, and elimination of columnar 
grains—in a way that strongly dramatizes structure-property relationships in 
practice. 

There are few ways to broadly influence such a wide variety of thin-film 
properties, in so simple and cheap a manner, than by application of substrate 
bias. 


3.7.6. Evaporation versus Sputtering 

Now that the details of evaporation and sputtering have been presented, we 
compare their characteristics with respect to process variables and resulting 
film properties. Distinctions in the stages of vapor species production, trans¬ 
port through the gas phase, and condensation on substrate surfaces for the two 
PVD processes are reviewed in tabular form in Table 3-7. 
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Table 3-7. Evaporation versus Sputtering 


Evaporation 


Sputtering 

A. Production of Vapor Species 

1. Thermal evaporation mechanism 

1. 

Ion bombardment and collisional 

momentum transfer 

2. Low kinetic energy of evaporant 

2. 

High kinetic energy of sputtered 

atoms (at 1200 K, E = 0.1 eV) 


atoms ( E - 2-30 eV) 

3. Evaporation rate (Eq. 3-2) (for 

3. 

Sputter rate (at 1 mA/cm 2 and 

M = 50, T = 1500 K, and P e = lO’ 3 ) 

= 1.3 x 10 17 atoms/cm 2 -sec. 


S = 2) » 3 X 10 16 atoms/cm 2 -sec 

4. Directional evaporation according 

4. 

Directional sputtering according to 

to cosine law 


cosine law at high sputter rates 

5. Fractionation of multicomponent 

5. 

Generally good maintenance of target 

alloys, decomposition, and 


stoichiometry, but some 

dissociation of compounds 


dissociation of compounds. 

6. Availability of high evaporation 

6. 

Sputter targets of all materials 

source purities 


are available; purity varies with 
material 

B. The Gas Phase 

1. Evaporant atoms travel in high or 

1. 

Sputtered atoms encounter high- 

ultrahigh vacuum (~ 10 _ 6 -10 — 10 


pressure discharge region 

torr) ambient 


(~ 100 mtorr) 

2. Thermal velocity of evaporant 

2. 

Neutral atom velocity ~5x 10 4 

10 5 cm/sec 


cm/sec 

3. Mean-free path is larger than 

3. 

Mean-free path is less than target- 

evaporant-substrate spacing. 


substrate spacing. Sputtered atoms 

Evaporant atoms undergo no 


undergo many collisions in the 

collisions in vacuum 


discharge 

C. The Condensed 

Film 

1. Condensing atoms have relatively 
low energy 

1. 

Condensing atoms have high energy 

2. Low gas incorporation 

2. 

Some gas incorporation 

3. Grain size generally larger than 
for sputtered Film 

3. 

Good adhesion to substrate 

4. Few grain orientations (textured 
films) 

4. 

Many grain orientations 

3.8. Hybrid and Modified PVD Processes 


This chapter concludes with a discussion of several PVD processes that are 
more complex than the conventional ones considered up to this point. They 
demonstrate the diversity of process hybridization and modification possible in 
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producing films with unusual properties. Ion plating, reactive evaporation, and 
ion-beam-assisted deposition will be the processes considered first. In the first 
two, the material deposited usually originates from a heated evaporation 
source. In the third, well-characterized ion beams bombard films deposited by 
evaporation or sputtering. The chapter closes with a discussion of ionized 
cluster-beam deposition. This process is different from others considered in 
this chapter in that film formation occurs through impingement of collective 
groups of atoms from the gas phase rather than individual atoms. 

3.8.1. Ion Plating 

Ion plating, developed by Mattox (Ref. 29), refers to evaporated film deposi¬ 
tion processes in which the substrate is exposed to a flux of high-energy ions 
capable of causing appreciable sputtering before and during film formation. A 
schematic representation of a diode-type batch, ion-plating system is shown in 
Fig. 3-25a. Since it is a hybrid system, provision must be made to sustain the 
plasma, cause sputtering, and heat the vapor source. Prior to deposition, the 
substrate, negatively biased from 2 to 5 kV, is subjected to inert-gas ion 
bombardment at a pressure in the millitorr range for a time sufficient to 
sputter-clean the surface and remove contaminants. Source evaporation is then 
begun without interrupting the sputtering, whose rate must obviously be less 
than that of the deposition rate. Once the interface between film and substrate 
has formed, ion bombardment may or may not be continued. To circumvent 
the relatively high system pressures associated with glow discharges, high- 
vacuum ion-plating systems have also been constructed. They rely on directed 
ion beams targeted at the substrate. Such systems, which have been limited 
thus far to research applications, are discussed in Section 3.8.3. 

Perhaps the chief advantage of ion plating is the ability to promote extremely 
good adhesion between the film and substrate by the ion and particle bombard¬ 
ment mechanisms discussed in Section 3.7.5. A second important advantage is 
the high “throwing power” when compared with vacuum evaporation. This 
results from gas scattering, entrainment, and sputtering of the film, and 
enables deposition in recesses and on areas remote from the source-substrate 
line of sight. Relatively uniform coating of substrates with complex shapes is 
thus achieved. Lastly, the quality of deposited films is frequently enhanced. 
The continual bombardment of the growing film by high-energy ions or neutral 
atoms and molecules serves to peen and compact it to near bulk densities. 
Sputtering of loosely adhering film material, increased surface diffusion, and 
reduced shadowing effects serve to suppress undesirable columnar growth. 



CATHODE DARK SPACE 



(a) (b) 

Figure 3-25. Hybrid PVD process: (a) Ion plating. (From Ref. 29). (b) Activated reactive evaporation. (From Ref. 30). (c) 
Ion-beam-assisted deposition. (From Ref. 31). 


134 Physical Vapor Deposition 








3.8 Hybrid and Modified PVD Processes 


135 



Figure 3-25. Continued. 


A major use of ion plating has been to coat steel and other metals with very 
hard films for use in tools and wear-resistant applications. For this purpose, 
metals like Ti, Zr, Cr, and Si are electron-beam-evaporated through an Ar 
plasma in the presence of reactive gases such as N 2 ,0 2 , and CH 4 , which are 
simultaneously introduced into the system. This variant of the process is 
known as reactive ion plating (RIP), and coatings of nitrides, oxides, and 
carbides have been deposited in this manner. 


3.8.2. Reactive Evaporation Processes 

In reactive evaporation the evaporant metal vapor flux passes through and 
reacts with a gas (at 1-30 X 10~ 3 torr) introduced into the system to produce 
compound deposits. The process has a history of evolution in which evapora¬ 
tion was first carried out without ionization of the reactive gas. In the more 
recent activated reactive evaporation (ARE) processes developed by Bunshah 
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and co-workers (Ref. 30), a plasma discharge is maintained directly within the 
reaction zone between the metal source and substrate. Both the metal vapor 
and reactive gases, such as 0 2 , N 2 , CH 4 , C 2 H 2 , etc., are, therefore, ionized 
increasing their reactivity on the surface of the growing film or coating, 
promoting stoichiometric compound formation. One of the process configura¬ 
tions is illustrated in Fig. 3-25b, where the metal is melted by an electron 
beam. A thin plasma sheath develops on top of the molten pool. Low-energy 
secondary electrons from this source are drawn upward into the reaction zone 
by a circular wire electrode placed above the melt biased to a positive dc 
potential (20-100 V), creating a plasma-filled region extending from the 
electron-beam gun to near the substrate. The ARE process is endowed with 
considerable flexibility, since the substrates can be grounded, allowed to float 
electrically, or biased positively or negatively. In the latter variant ARE is 
quite similar to RIP. Other modifications of ARE include resistance-heated 
evaporant sources coupled with a low-voltage cathode (electron) emitter-anode 
assembly. Activation by dc and RF excitation has also been employed to 
sustain the plasma, and transverse magnetic fields have been applied to 
effectively extend plasma electron lifetimes. 

Before considering the variety of compounds produced by ARE, we recall 
that thermodynamic and kinetic factors are involved in their formation. The 
high negative enthalpies of compound formation of oxides, nitrides, carbides, 
and borides indicate no thermodynamic obstacles to chemical reaction. The 
rate-controlling step in simple reactive evaporation is frequently the speed of 
the chemical reaction at the reaction interface. The actual physical location of 
the latter may be the substrate surface, the gas phase, the surface of the metal 
evaporant pool, or a combination of these. Plasma activation generally lowers 
the energy barrier for reaction by creating many excited chemical species. By 
eliminating the major impediment to reaction, ARE processes are thus capable 
of deposition rates of a few thousand angstroms per minute. 

A partial list of compounds synthesized by ARE methods includes the oxides 
c*A1 2 0 3 , V 2 0 3 , Ti0 2 , indium-tin oxide; the carbides TiC, ZrC, NbC, Ta 2 C, 
W 2 C, VC, HfC; and the nitrides TiN, MoN, HfN, and cubic boron nitride. 
The extremely hard TiN, TiC, A1 2 0 3 , and HfN compounds have found 
extensive use as coatings for sintered carbide cutting tools, high-speed drills, 
and gear cutters. As a result, they considerably increase wear resistance and 
extend tool life. In these applications ARE processing competes with the CVD 
methods discussed in Chapters 4 and 12. The fact that no volatile metal-bearing 
compound is required as in CVD is an attractive advantage of ARE. Most 
significantly, these complex compound films are synthesized at relatively low 
temperatures; this is a unique feature of plasma-assisted deposition processes. 
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3.8.3. lon-Beam-Assisted Deposition Processes (Ref. 31) 

We noted in Section 3.7.5 that ion bombardment of biased substrates during 
sputtering is a particularly effective way to modify film properties. Process 
control in plasmas is somewhat haphazard, however, because the direction, 
energy, and flux of the ions incident on the growing film cannot be regulated. 
Ion-beam-assisted processes were invented to provide independent control of 
the deposition parameters and, particularly, the characteristics of the ions 
bombarding the substrate. Two main ion source configurations are employed. 
In the dual-ion-beam system, one source provides the inert or reactive ion 
beam to sputter a target in order to yield a flux of atoms for deposition onto 
the substrate. Simultaneously, the second ion source, aimed at the substrate, 
supplies the inert or reactive ion beam that bombards the depositing film. 
Separate film-thickness-rate and ion-current monitors, fixed to the substrate 
holder, enable the two incident beam fluxes to be independently controlled. 

In the second configuration (Fig. 3-25c), an ion source is used in conjunc¬ 
tion with an evaporation source. The process, known as ion-assisted deposi¬ 
tion (IAD), combines the benefits of high film deposition rate and ion 
bombardment. The energy flux and direction of the ion beam can be regulated 
independently of the evaporation flux. In both configurations the ion-beam 
angle of incidence is not normal to the substrate and can lead to anisotropic 
film properties. Substrate rotation is, therefore, recommended if isotropy is 
desired. 

Broad-beam (Kaufman) ion sources, the heart of ion-beam-assisted deposi¬ 
tion systems, were first used as ion thrusters for space propulsion (Ref. 32). 
Their efficiency has been optimized to yield high-ion-beam fluxes for given 
power inputs and gas flows. They contain a discharge chamber that is raised to 
a potential corresponding to the desired ion energy. Gases fed into the chamber 
become ionized in the plasma, and a beam of ions is extracted and accelerated 
through matching apertures in a pair of grids. Current densities of several 
mA/cm 2 are achieved. (Note that 1 mA/cm 2 is equivalent to 6.25 x 10 15 
ions/cm 2 -sec or several monolayers per second.) The resulting beams have a 
low-energy spread (typically 10 eV) and are well collimated, with divergence 
angles of only a few degrees. Furthermore, the background pressure is quite 
low (— 10~ 4 torr) compared with typical sputtering or etching plasmas. 

Examples of thin-film property modification as a result of IAD are given in 
Table 3-8. The reader should appreciate the applicability to all classes of solids 
and to a broad spectrum of properties. For the most part, ion energies are 
lower than those typically involved in sputtering. Bombarding ion fluxes are 
generally smaller than depositing atom fluxes. Perhaps the most promising 
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Table 3-8. Property Modification by Ion Bombardment during Film Deposition 


Film 

material 

Ion 

species 

Property 

modified 

Ion 

energy 

(eV) 

Ion/Atom 

Arrival 

Rate Ratio 

Ge 

Ar + 

Stress, 

adhesion 

65-3000 

2 x 10“ 4 to 

10“ 1 

Nb 

Ar + 

Stress 

100-400 

3 x 10 2 

Cr 

Ar + , Xe + 

Stress 

3,400-11,500 

8 x 10“ 3 to 

4 x 10“ 2 

Or 

Ar + 

Stress 

200-800 

- 7 x 10~ 3 to 

2 x 10~ 2 

Si0 2 

Ar + 

Step 

coverage 

500 

0.3 

Si0 2 

Ar + 

Step 

coverge 

- 1-80 

- 4.0 

AIN 

n 2 + 

Preferred 

orientation 

300-500 

0.96 to 

1.5 

Au 

Ar + 

Coverage at 

o 

50 A thickness 

400 

0.1 

GdCoMo 

Ar + 

Magnetic 

anisotropy 

- 1-150 

- 0.1 

Cu 

Cu + 

Improved 

epitaxy 

50-400 

10^ 2 

BN 

(B-N-H) + 

Cubic 

structure 

200-1000 

- 1.0 

Zr0 2 , 

Si0 2 , Ti0 2 

Ar + , 0 2 + 

Refractive 

index, 

amor -*■ crys 

600 

2.5 x 10“ 2 to 
10“' 

Si0 2 , Ti0 2 

o 2 + 

Refractive 

index 

300 

0.12 

Si0 2 , Ti0 2 

0 2 + 

Optical 

transmission 

30-500 

0.05 to 

0.25 

Cu 

N + , Ar + 

Adhesion 

50,000 

io - 2 

Ni on Fe 

Ar + 

Hardness 

10,000-20,000 

- 0.25 


From Ref. 32. 


application of ion bombardment is the enhancement of the density and index of 
refraction of optical coatings. This subject is treated again in Chapter 11. 

3.8.4. Ionized Cluster Beam (ICB) Deposition (Ref. 33) 

The idea of employing energetic ionized clusters of atoms to deposit thin films 
is due to T. Takagi. In this novel technique, vapor-phase aggregates or 
clusters, thought to contain a few hundred to a few thousand atoms, are 
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SUBSTRATE 



Figure 3-26. Schematic diagram of ICB system. (Courtesy of W. L. Brown, AT&T 
Bell Laboratories. Reprinted with permission of the publisher from Ref. 34). 

created, ionized, and accelerated toward the substrate as depicted schematically 
in Fig. 3-26. As a result of impact with the substrate, the cluster breaks apart, 
releasing atoms to spread across the surface. Cluster production is, of course, 
the critical step and begins with evaporation from a crucible containing a small 
aperture or nozzle. The evaporant vapor pressure is much higher (10 -2 -10 
torr) than in conventional vacuum evaporation. For cluster formation the 
nozzle diameter must exceed the mean-free path of vapor atoms in the crucible. 
Viscous flow of atoms escaping the nozzle then results in an adiabatic 
supersonic expansion and the formation of stable cluster nuclei. Optimum 
expansion further requires that the ratio of the vapor pressure in the crucible to 
that in the vacuum chamber exceed 10 4 to 10 5 . 

The arrival of ionized clusters with the kinetic energy of the acceleration 
voltage (0-10 kV), and neutral clusters with the kinetic energy of the nozzle 
ejection velocity, affects film nucleation and growth processes in the following 
ways: 

1. The local temperature at the point of impact increases. 

2. Surface diffusion of atoms is enhanced. 
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3. Activated centers for nucleation are created. 

4. Coalescence of nuclei is fostered. 

5. At high enough energies, the surface is sputter-cleaned, and shallow 
implantation of ions may occur. 

6. Chemical reactions between condensing atoms and the substrate or gas-phase 
atoms are favored. 

Moreover, the magnitude of these effects can be modified by altering the 
extent of electron impact ionization and the accelerating voltage. 

Virtually all classes of film materials have been deposited by ICB (and 
variant reactive process versions), including pure metals, alloys, intermetallic 
compounds, semiconductors, oxides, nitrides, carbides, halides, and organic 
compounds. Special attributes of ICB-prepared films worth noting are strong 
adhesion to the substrate, smooth surfaces, elimination of columnar growth 
morphology, low-temperature growth, controllable crystal structures, and, 
importantly, very high quality single-crystal growth (epitaxial films). Large Au 
film mirrors for C0 2 lasers, ohmic metal contacts to Si and GaP, electromigra¬ 
tion- (Section 8.4) resistant A1 films, and epitaxial Si, GaAs, GaP, and InSb 
films deposited at low temperatures are some examples indicative of the 
excellent properties of ICB films. Among the advantages of ICB deposition are 
vacuum cleanliness (~ 10” 7 torr in the chamber) of evaporation and energetic 
ion bombardment of the substrate, two normally mutually exclusive features. 
In addition, the interaction of slowly moving clusters with the substrate is 
confined, limiting the amount of damage to both the growing film and 
substrate. Despite the attractive features of ICB, the formation of clusters and 
their role in film formation are not well understood. Recent research (Ref. 34), 
however, clearly indicates that the total number of atoms agglomerated in large 
metal clusters is actually very small (only 1 in 10 4 ) and that only a fraction of 
large clusters is ionized. The total energy brought to the film surface by 
ionized clusters is, therefore, quite small. Rather, it appears that individual 
atomic ions, which are present in much greater profusion than are ionized 
clusters, are the dominant vehicle for transporting energy and momentum to 
the growing film. In this respect, ICB deposition belongs to the class of 
processes deriving benefits from the ion-beam-assisted film growth mecha¬ 
nisms previously discussed. 



Exercises 



1. Employing Figs. 3-1 and 3-2, calculate values for the molar heat of 
vaporization of Si and Ga. 
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2 . Design a laboratory experiment to determine a working value of the heat 
of vaporization of a metal employing common thin-film deposition and 
characterization equipment. 

3 . Suppose Fe satisfactorily evaporates from a surface source, 1 cm 2 in 
area, which is maintained at 1550 °C. Higher desired evaporation rates 
are achieved by raising the temperature 100 °C. But doing this will bum 
out the source. Instead, the melt area is increased without raising its 
temperature. By what factor should the source area be enlarged? 

4 - A molecular-beam epitaxy system contains separate A1 and As effusion 
evaporation sources of 4 cm 2 area, located 10 cm from a (100) GaAs 
substrate. The A1 source is heated to 1000 °C, and the As source is 
heated to 300 °C. What is the growth rate of the AlAs film in A/sec? 
[Note: AlAs basically has the same crystal structure and lattice parameter 
(5.661 A) as GaAs.] 

5 - How far from the substrate, in illustrative problem on p. 90, would a 
single surface source have to be located to maintain the same deposited 
film thickness tolerance? 

6 - An A1 film was deposited at a rate of 1 /*m/min in vacuum at 25 °C, and 
it was estimated that the oxygen content of the film was 10 ~ 3 . What was 
the partial pressure of oxygen in the system? 

7 , Alloy films of Ti-W, used as diffusion barriers in integrated circuits, are 
usually sputtered. The Ti-W, phase diagram resembles that of Ge-Si 
(Fig. 1-13) at elevated temperatures. 

a. Comment on the ease or feasibility of evaporating a 15 wt% Ti-W 
alloy. 

b. During sputtering with 0.5-keV Ar, what composition will the target 
surface assume in the steady state? 

8 - In order to deposit films of the alloy YBa 2 Cu 3 , the metals Y, Ba, and Cu 
are evaporated from three point sources. The latter are situated at the 
comers of an equilateral triangle whose side is 20 cm. Directly above the 
centroid of the source array, and parallel to it, lies a small substrate; the 
deposition system geometry is thus a tetrahedron, each side being 20 cm 
long. 

a. If the Y source is heated to 1740 K to produce a vapor pressure of 
10 3 torr, to what temperature must the Cu source be heated to 
maintain film stoichiometry? 
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b. Rather than a point source, a surface source is used to evaporate Cu. 
How must the Cu source temperature be changed to ensure deposit 
stoichiometry? 

c. If the source configuration in part (a) is employed, what minimum 0 2 
partial pressure is required to deposit stoichiometric YBa 2 Cu 3 0 7 
superconducting films by a reactive evaporation process? The atomic 
weights are Y = 89, Cu = 63.5, Ba = 137, and O = 16. 

9 . One way to deposit a thin metal film of known thickness is to heat an 
evaporation source to dryness (i.e., until no metal remains in the crucible). 

o 

Suppose it is desired to deposit 5000 A of Au on the internal spherical 
surface of a hemispherical shell measuring 30 cm in diameter. 

a. Suggest two different evaporation source configurations (source type 
and placement) that would yield uniform coatings. 

b. What weight of Au would be required for each configuration, assum¬ 
ing evaporation to dryness? 

10 - Suppose the processes of electron impact ionization and secondary emis¬ 
sion of electrons by ions control the current J in a sputtering system 
according to the Townsend equation (Ref. 19) 

/ 0 exp ad 

1 - 7 [exp(ad/) - l] 

where J 0 = primary electron current density from external source 
a = number of ions per unit length produced by electrons 
7 = number of secondary electrons emitted per incident ion 
d = interelectrode spacing. 

a. If the film deposition rate during sputtering is proportional to the 
product of J and S, calculate the proportionality constant for Cu in 
this system if the deposition rate is 200 A/min for 0.5-keV Ar ions. 
Assume a = 0.1 ion/cm, 7 = 0.08 electron/ion, d = 10 cm, and 
J 0 = 100 mA/cm 2 . 

b. What deposition rate can be expected for 1 -keV Ar if a = 0.15 
ion/cm and 7 = 0.1 electron/ion. 

11 . In a dc planar magnetron system operating at 1000 V, the anode-cathode 
spacing is 10 cm. What magnetic field should be applied to trap electrons 
within 1 cm of the target? 

12 . At what sputter deposition rate of In on a Si substrate will the film melt 
within 1 min? The melting point of In is 155 °C. 
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13- a. During magnetron sputtering of Au at 1 keV, suppose there are two 

collisions with Ar atoms prior to deposition. What is the energy of the 
depositing Au atoms? (Assume Ar is stationary in a collision.) 

b. The probability that gas-phase atoms will travel a distance x without 
collision is exp - x j\ where X is the mean-free path between 
collisions. Assume X for Au in Ar is 5 cm at a pressure of 1 mtorr. If 
the target-anode spacing is 12 cm, at what operating pressure will 
99% of the sputtered Au atoms undergo gas-phase collisions prior to 
deposition? 

14- For a new application it is desired to continuously coat a 1-m-wide steel 
strip with a 2-jim-thick coating of Al. The x-y dimensions of the steel are 
such that an array of electron-beam gun evaporators lies along the y 
direction and maintains a uniform coating thickness across the strip width. 
How fast should the steel be fed in the x direction past the surface 
sources, which can evaporate 20 g of Al per second? Assume that Eq. 
3-18 holds for the coating thickness along the x direction, that the 
source-strip distance is 30 cm, and that the steel sheet is essentially a 
horizontal substrate 40 cm long on either side of the source before it is 
coiled. 

15. Select the appropriate film deposition process (evaporation, sputtering, 
etc., sources, targets, etc.) for the following applications: 

a. Coating a large telescope mirror with Rh 

b. Web coating of potato chip bags with Al films 

c. Deposition of Al-Cu-Si thin-film interconnections for integrated cir¬ 
cuits 

d. Deposition of Ti0 2 -Si0 2 multilayers on artificial gems to enhance 
color and reflectivity 

16- Theory indicates that the kinetic energy ( E ) and angular spread of neutral 
atoms sputtered from a surface are given by the distribution function 

E 

F(E,6) = CS - rcos0, 

V ’ (E+U)> 

where U = binding energy of surface atoms 
C = constant 

$ = angle between sputtered atoms and the surface normal. 

a. Sketch the dependence of /(£, 0) vs. E for two values of U. 

b. Show that the maximum in the energy distribution occurs at E = U/2. 
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17 . a To better visualize the nucleation of clusters in the ICB process, 
schematically indicate the free energy of cluster formation vs. cluster 
size as a function of vapor supersaturation (see Section 1.7). 

b. What vapor supersaturation is required to create a 1000-atom cluster 
of Au if the surface tension is 1000 ergs/cm 2 ? 

c. If such a cluster is ionized and accelerated to an energy of 10 keV, 
how much energy is imparted to the substrate by each cluster atom? 

References 


1. W. R. Grove, Phil. Trans. Roy. Soc., London A 142, 87 (1852). 

2. M. Faraday, Phil. Trans. 147, 145 (1857). 

3. * R. Glang, in Handbook of Thin Film Technology , eds. L. I. Maissel 

and R. Glang, McGraw-Hill, New York (1970). 

4. * J. L. Vossen and J. J. Cuomo, in Thin Film Processes , eds. J. L. 

Vossen and W. Kern, Academic Press, New York (1978). 

5. * W. D. Westwood, in Microelectronic Materials and Processes , ed. 

R. A. Levy, Kluwer Academic, Dordrecht (1989). 

6 . * B. N. Chapman, Glow Discharge Processes, Wiley, New York (1980). 

7. C. H. P. Lupis, Chemical Thermodynamics of Materials , North-Hol- 
land, Amsterdam (1983). 

8 . R. E. Honig, RCA Rev. 23, 567 (1962). 

9. * H. K. Pulker, Coatings on Glass , Elsevier, New York, (1984). 

10. Examples taken from Physical Vapor Deposition , Airco-Temescal 
(1976). 

11. L. Holland, Vacuum Deposition of Thin Films , Wiley, New York 
(1956). 

12. C. H. Ting and A. R. Neureuther, Solid State Technol. 25(2), 115 
(1982). 

13. H. L. Caswell, in Physics of Thin Films , Vol. 1, ed. G. Hass, 
Academic Press, New York (1963). 

14. L. D. Hartsough and D. R. Denison, Solid State Technology 22(12), 
66 (1979). 

15. Handbook—The Optical Industry and Systems Directory , H-ll 
(1979). 

16. E. B. Grapper, J. Vac. Sci. Technol. 5A<4), 2718 (1987); 8, 333 
(1971). 


‘Recommended texts or reviews. 




References 


145 


17. P. Archibald and E. Parent, Solid State Technol. 19(7), 32 (1976). 

18. D. M. Mattox, J. Vac. Sci. Technol. A7(3), 1105 (1989). 

19. * A. B. Glaser and G. E. Subak-Sharpe, Integrated Circuit Engineering , 

Addison-Wesley, Reading, MA (1979). 

20. P. Sigmund, Phys. Rev . 184, 383 (1969). 

21. L. T. Lamont, Solid Stale Technol. 22(9), 107 (1979). 

22. J. A. Thornton, Thin Solid Films 54, 23 (1978). 

23. J. A. Thornton, in Thin Film Processes , eds. J. L. Vossen and W. 
Kern, Academic Press, New York (1978). 

24. H. R. Koenig and L. I. Maissel, IBM J. Res. Dev. 14, 168 (1970). 

25. * W. D. Westwood, in Physics of Thin Films , Vol. 14, eds. M. H. 

Francombe and J. L. Vossen, Academic Press, New York (1989). 

26. * L. I. Maissel and M. H. Francombe, An Introduction to Thin Films , 

Gordon and Breach, New York, (1973). 

27. L. I. Maissel and P. M. Schaible, J. Appl . Phys. 36, 237 (1965). 

28. J. L. Vossen and J. J. O’Neill, RCA Rev. 29, 566 (1968). 

29. D. M. Mattox, J. Vac. Sci. Technol. 10, 47 (1973). 

30. * R. F. Bunshah and C. Deshpandey, in Physics of Thin Films , Vol. 13, 

eds. M. H. Francombe and J. L. Vossen, Academic Press, New York 
(1987). 

31. J. M. E. Harper and J. J. Cuomo, J. Vac. Sci. Technol. 21(3), 737 (1982). 

32. J. M. E. Harper, J. J. Cuomo, R. J. Gambino, and H. R. Kaufman, in 
Ion Beam Modification of Surfaces , eds. O. Auciello and R. Kelly, 
Elsevier, Amsterdam (1984). 

33. * T. Takagi, in Physics of Thin Films , Vol. 13, eds. M. H. Francombe 

and J. L. Vossen, Academic Press, New York (1987). 

34. W. L. Brown, M. F. Jarrold, R. L. McEachem, M. Sosnowski, G. 
Takaoka, H. Usui and I. Yamada, Nuclear Instruments and Methods 
in Physics Research , to be published (1991). 





Chemical Vapor Deposition 


4 . 1 . Introduction 


Chemical vapor deposition (CVD) is the process of chemically reacting a 
volatile compound of a material to be deposited, with other gases, to produce a 
nonvolatile solid that deposits atomistically on a suitably placed substrate. 
High-temperature CVD processes for producing thin films and coatings have 
found increasing applications in such diverse technologies as the fabrication of 
solid-state electronic devices, the manufacture of ball bearings and cutting 
tools, and the production of rocket engine and nuclear reactor components. In 
particular, the need for high-quality epitaxial semiconductor films for both Si 
bipolar and MOS transistors, coupled with the necessity to deposit various 
insulating and passivating films at low temperatures, has served as a powerful 
impetus to spur development and implementation of CVD processing methods. 
A schematic view of the MOS field effect transistor structure in Fig. 4-1 
indicates the extent to which the technology is employed. Above the plane of 
the base P-Si wafer, all of the films with the exception of the gate oxide and 
A1 metallization are deposited by some variant of CVD processing. The films 
include polysilicon, dielectric Si0 2 , and SiN. 

Among the reasons for the growing adoption of CVD methods is the ability 
to produce a large variety of films and coatings of metals, semiconductors, and 
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DIELECTRIC SiN 



Figure 4-1. Schematic view of MOS field effect transistor cross section. 

compounds in either a crystalline or vitreous form, possessing high purity and 
desirable properties. Furthermore, the capability of controllably creating films 
of widely varying stoichiometry makes CVD unique among deposition tech¬ 
niques. Other advantages include relatively low cost of the equipment and 
operating expenses, suitability for both batch and semicontinuous operation, 
and compatibility with other processing steps. Hence, many variants of CVD 
processing have been researched and developed in recent years, including 
low-pressure (LPCVD), plasma-enhanced (PECVD), and laser-enhanced 
(LECVD) chemical vapor deposition. Hybrid processes combining features of 
both physical and chemical vapor deposition have also emerged. 

In this chapter, a number of topics related to the basic chemistry, physics, 
engineering, and materials science involved in CVD are explored. Practical 
concerns of chemical vapor transport, deposition processes, and equipment 
involved are discussed. The chapter is divided into the following sections: 

4.2. Reaction Types 

4.3. Thermodynamics of CVD 

4.4. Gas Transport 

4.5. Growth Kinetics 

4.6. CVD Processes and Systems 

Recommended review articles and books dealing with these aspects of CVD 
can be found in Refs. 1-7. 

To gain an appreciation of the scope of the subject, we first briefly 
categorize the various types of chemical reactions that have been employed to 
deposit films and coatings (Refs. 1-3). Corresponding examples are given for 
each by indicating the essential overall chemical equation and approximate 
reaction temperature. 



4.2. Reaction Types 


149 


4.2. Reaction Types 


4.2.1. Pyrolysis 

Pyrolysis involves the thermal decomposition of such gaseous species as 
hydrides, carbonyls, and organometallic compounds on hot substrates. Com¬ 
mercially important examples include the high-temperature pyrolysis of silane 
to produce polycrystalline or amorphous silicon films, and the low-temperature 
decomposition of nickel carbonyl to deposit nickel films. 


Si (J) + 2H 2(f) 

(650 °C), 

(4-1) 

NKOO)« g) - Ni (I) + 4CO (g) 

(180 °C). 

(4-2) 


Interestingly, the latter reaction is the basis of the Mond process, which has 
been employed for over a century in the metallurgical refining of Ni. 


4.2.2. Reduction 

These reactions commonly employ hydrogen gas as the reducing agent to effect 
the reduction of such gaseous species as halides, carbonyl halides, oxyhalides, 
or other oxygen-containing compounds. An important example is the reduction 
of SiCl 4 on single-crystal Si wafers to produce epitaxial Si films according to 
the reaction 


SiCl^g) + 2H 2(g) -» Si (i) + 4HCl (g) (1200 °C). (4-3) 

Refractory metal films such as W and Mo have been deposited by reducing the 
corresponding hexafluorides, e.g., 

WF 6(4) + 3H 2U) -W U) + 6HF (g) (300 °C), (4-4) 

MoF 6(s) + 3H 2(g) - Mo (J) + 6HF (g) (300 °C). (4-5) 

Tungsten films deposited at low temperatures have been actively investigated 
as a potential replacement for aluminum contacts and interconnections in 
integrated circuits. Interestingly, WF 6 gas reacts directly with exposed silicon 
surfaces, depositing thin W films while releasing the volatile SiF 4 by-product. 
In this way silicon contact holes can be selectively filled with tungsten while 
leaving neighboring insulator surfaces uncoated. 
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4.2.3. Oxidation 

Two examples of important oxidation reactions are 

SiH 4(s) + 0 2(f , - Si0 2(J) + 2H 2(g) (450 °C), (4-6) 

4PH 3U) + 50 2(g) - 2P 2 0 5(s) + 6H 2(g) (450 °C). (4-7) 

The deposition of Si0 2 by Eq. 4-6 is often carried out at a stage in the 
processing of integrated circuits where higher substrate temperatures cannot be 
tolerated. Frequently, about 7% phosphorous is simultaneously incorporated in 
the Si0 2 film by the reaction of Eq. 4-7 in order to produce a glass film that 
flows readily to produce a planar insulating surface, i.e., “planarization.” 

In another process of technological significance, Si0 2 is also produced by 
the oxidation reaction 

SiCl 4(g) + 2H 2(g) + 0 2(g) - Si0 2(g) + 4HCl (g) (1500 °C). (4-8) 

The eventual application here is the production of optical fiber for communica¬ 
tions purposes. Rather than a thin film, the Si0 2 forms a cotton-candy-like 
deposit consisting of soot particles less than 1000 A in size. These are then 
consolidated by elevated temperature sintering to produce a fully dense silica 
rod for subsequent drawing into fiber. Whether silica film deposition or soot 
formation occurs is governed by process variables favorable to heterogeneous 
or homogeneous nucleation, respectively. Homogeneous soot formation is 
essentially the result of a high SiCl 4 concentration in the gas phase. 

4.2.4. Compound Formation 

A variety of carbide, nitride, boride, etc., films and coatings can be readily 
produced by CVD techniques. What is required is that the compound elements 
exist in a volatile form and be sufficiently reactive in the gas phase. Examples 
of commercially important reactions include 

SiCl 4(a) + CH 4( „ - SiC (f) + 4HCl (g) (1400 -C), (4-9) 

TiCl 4(<) + CH 4(g) - TiC ( „ + 4HCl (g) (1000 °C), (4-10) 

BF 3(g) + NH 3U) - BN (i) + 3HF (g) (1100'C) (4-11) 

for the deposition of hard, wear-resistant surface coatings. Films and coatings 
of compounds can generally be produced through a variety of precursor gases 
and reactions. For example, in the much studied SiC system, layers were first 



4.2. Reaction Types 


151 


produced in 1909 through reaction of SiCl 4 + C 6 H 6 (Ref. 8). Subsequent 
reactant combinations over the years have included SiCl 4 + C 3 H 8 , SiBr 4 + 
C 2 H 4 , SiCl 4 + C 6 H 14 , SiHCl 3 + CC1 4 , and SiCl 4 + C 6 H 5 CH 3 , to name a 
few, as well as volatile organic compounds containing both silicon and carbon 
in the same molecule (e.g., CH 3 SiCl 3 , CH 3 SiH 3 , (CH 3 ) 2 SiCl 2 , etc.). Al¬ 
though the deposit is nominally SiC in all cases, resultant properties generally 
differ because of structural, compositional, and processing differences. 

Impermeable insulating and passivating films of Si 3 N 4 that are used in 
integrated circuits can be deposited at 750 °C by the reaction 

3SiCl 2 H 2U) + 4NH 3(g) - Si 3 N 4(s) + 6H 2(g) + 6HC \ (g) . (4-12) 

The necessity to deposit silicon nitride films at lower temperatures has led to 
alternative processing involving the use of plasmas. Films can be deposited 
below 300 °C with SiH 4 and NH 3 reactants, but considerable amounts of 
hydrogen are incorporated into the deposits. 

4.2.5. Disproportionation 

Disproportionation reactions are possible when a nonvolatile metal can form 
volatile compounds having different degrees of stability, depending on the 
temperature. This manifests itself in compounds, typically halides, where the 
metal exists in two valence states (e.g., Gel 4 and Gel 2 ) such that the 
lower-valent state is more stable at higher temperatures. As a result, the metal 
can be transported into the vapor phase by reacting it with its volatile, 
higher-valent halide to produce the more stable lower-valent halide. The latter 
disproportionates at lower temperatures to produce a deposit of metal while 
regenerating the higher-valent halide. This complex sequence can be simply 
described by the reversible reaction 

300 °C 

2Gel2 <*> iS Ge <*> + Gel4 <*> ( 4 - 13 ) 

and realized in systems where provision is made for mass transport between 
hot and cold ends. Elements that have lent themselves to this type of transport 
reaction include aluminum, boron, gallium, indium, silicon, titanium, zirco¬ 
nium, beryllium, and chromium. Single-crystal films of Si and Ge were grown 
by disproportionation reactions in the early days of CVD experimentation on 
semiconductors employing reactors such as that shown in Fig. 4-2. The 
enormous progress made in this area is revealed here. 
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TEMPERATURE (°C) 


Figure 4-2. Experimental reactor for epitaxial growth of Si films. (E. S. Wajda, B. 
W. Kippenhan, W. H. White, IBM J. Res. Dev. 7, 288, © i960 by International 
Business Machines Corporation, reprinted with permission). 


4.2.6. Reversible Transfer 


Chemical transfer or transport processes are characterized by a reversal in the 
reaction equilibrium at source and deposition regions maintained at different 
temperatures within a single reactor. An important example is the deposition of 
single-crystal (epitaxial) GaAs films by the chloride process according to the 
reaction 

750 *C 

As 4 (g) + As 2 (g> + 6GaCl (g) + 3H 2(g) ^=^6GaAs (5) + 6HCl (g) . (4-14) 

Here AsCl 3 gas from a bubbler transports Ga toward the substrates in the form 
of GaCl vapor. Subsequent reaction with As 4 causes deposition of GaAs. 
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Figure 4-3. Schematic of atmospheric CVD reactor used to grow GaAs and other 
compound semiconductor films by the hydride process. (Reprinted with permission 
from Ref. 10). 

Alternatively, in the hydride process, As is introduced in the form of AsH 3 
(arsine), and HC1 serves to transport Ga. Both processes essentially involve the 
same gas-phase reactions and are carried out in similar reactors, shown 
schematically in Fig. 4-3. What is significant is that single-crystal, binary 
(primarily GaAs and InP but also GaP and InAs) as well as ternary (e.g., 
(Ga, In)As and Ga(As, P)) compound films have been grown by these vapor 
phase epitaxy (VPE) processes. Similarly, in addition to binary and ternary 
semiconductor films, quaternary epitaxial films containing controlled amounts 
of Ga, In, As, and P have been deposited by the hydride VPE process. 
Combinations of gas mixtures and more complex reactors are required in this 
case to achieve the desired stoichiometries. The resulting films are the object 
of intense current research and development activity in a variety of optoelec¬ 
tronic devices (e.g., lasers and detectors). For quaternary alloy deposition by 
the hydride process, single-crystal InP substrates are employed. Gas-phase 
source reactions include 


2AsH 3 As 2 + 3H 2 , 
2PH 3 P 2 + 3H 2 , 
2HC1 + 2In ^ 2InCl + H 2 , 


2 * 


2HC1 + 2Ga 2GaCl + H 


(4-15) 
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Table 4-1. CVD Films and Coatings 





Deposition 


Deposited 



Temperature 


Material 

Substrate 

Input Reactants 

(°C) 

Crystallinity 

Si 

Single-crystal Si 

Either SiCl 2 H 2 , SiCl 3 H, 
or SiCl 4 -1- H 2 

1050-1200 

E 

Si 


SiH 4 + H 2 

600-700 

P 

Ge 

Single-crystal Ge 

GeCl 4 or GeH 4 + H 2 

600-900 

E 

SiC 

Single-crystal Si 

SiCl 4 , toluene, H 2 

1100 

P 

AIN 

Sapphire 

AICI3, nh 3 , h 2 

1000 

E 

ln 2 0 3 ; Sn 

Glass 

In-chelate, 

(C 4 H 9 ) 2 Sn(OOCH 3 ) 2 , H 2 0, 

o 2 ,n 2 

500 

A 

ZnS 

GaAs, GaP 

Zn, H 2 S, H 2 

825 

E 

CdS 

GaAs, sapphire 

Cd, H 2 S, H 2 

690 

E 

A1 2 0 3 

Si, 

A1(CH 3 ) 3 + 0 2 

275-475 

A 


cemented carbide 

AICI3, co 2 , h 2 

850-1100 

A 

Si0 2 

Si 

SiH 4 + 0 2 

450 

A 



SiCl 2 H 2 -1- 2N 2 0 

900 


Si 3 N 4 

Si0 2 

SiCl 2 H 2 + NH 3 

~ 750 

A 

SiNH 

Si0 2 

SiH 4 + NH 3 (plasma) 

300 

A 

Ti0 2 

Quartz 

Ti (OC 2 H 5 ) 4 + 0 2 

450 

A 

TiC 

Steel 

TiCl 4 ,CH 4 , H 2 

1000 

P 

TiN 

Steel 

TiCl 4 ,N 2 ,H 2 

1000 

P 

BN 

Steel 

BC1 3 ,NH 3 ,H 2 

1000 

P 

TiB 2 

Steel 

TiCl 4 ,BCl 3 ,H 2 

> 800 

P 


Note: E = epitaxial; P = polycrystalline; A = amorphous. 

Adapted from Refs. 1, 2, 3. 

Deposition reactions at substrates include 

2GaCl + As 2 + H 2 2GaAs + 2HC1, 

2GaCl + P 2 + H 2 2GaP + 2HC1, 

2InCl + P 2 + H 2 2InP 4- 2HC1, 

2InCl + As 2 + H 2 ** 2InAs + 2HC1. (4-16) 

Aspects of the properties of these deposited films will be discussed in Chap¬ 
ter 7. 

The previous examples are but a small sample of the total number of film 
and coating deposition reactions that have been researched in the laboratory as 
well as developed for commercial applications. Table 4-1 contains a brief list 
of CVD processes for depositing elemental and compound semiconductors and 
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assorted compounds. The entries are culled from various review articles (Refs. 
1-6) on the subject, and the interested reader is encouraged to consult these 
articles to obtain the primary references for specific details on process vari¬ 
ables. 

In carefully examining the aforementioned categories of CVD reactions, the 
discerning reader will note two common features: 

1. All of the chemical reactions can be written in the simplified generalized 
form 


°^(g) + C ^(s) + ^ D (S) > ( 4 ~ 17 ) 

where A, B,. . . refer to the chemical species and a, b,. . . to the corre¬ 
sponding stoichiometric coefficients. A single solid and mixture of gaseous 
species categorizes each heterogeneous reaction. 

2. Many of the reactions are reversible, and this suggests that standard 
concepts of chemical thermodynamics may prove fruitful in analyzing them. 

There is a distinction between chemical vapor deposition and chemical 
vapor transport reactions that should be noted. In the former, one or more 
gaseous species enter the reactor from gas tanks or liquid bubbler sources 
maintained outside the system. The reactants then combine at the hot substrate 
to produce the solid film. In chemical vapor transport reactions, solid or liquid 
sources are contained within closed or open reactors. In the latter case, 
externally introduced carrier or reactant gases flow over the sources and cause 
them to enter the vapor stream where they are transported along the reactor. 
Subsequently, deposition of solid from the gas phase occurs at the substrates. 
Both chemical vapor deposition and transport reactions are, however, de¬ 
scribed by the same type of chemical reaction. As far as thermodynamic 
analyses are concerned, no further distinction will be made between them, and 
the generic term CVD will be used for both. 


4.3. Thermodynamics of CVD 


4.3.1. Reaction Feasibility 

Thermodynamics addresses several important issues with respect to CVD. 
Whether a given chemical reaction is feasible is perhaps the most important of 
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these. Once it is decided that a reaction is possible, thermodynamic calculation 
can frequently provide information on the partial pressures of the involved 
gaseous species and the direction of transport in the case of reversible 
reactions. Importantly, it provides an upper limit of what to expect under 
specified conditions. Thermodynamics does not, however, consider questions 
related to the speed of the reaction and resulting film growth rates. Indeed, 
processes that are thermodynamically possible frequently proceed at such low 
rates, due to vapor transport kinetics and vapor-solid reaction limitations, as to 
be unfeasible in practice. Furthermore, the use of thermodynamics implies that 
chemical equilibrium has been attained. Although this may occur in a closed 
system, it is generally not the case in an open or flow reactor where gaseous 
reactants and products are continuously introduced and removed. Thus, CVD 
may be presently viewed as an empirical science with thermodynamic guide¬ 
lines. 

Provided that the free-energy change AG can be approximated by the 
standard free-energy change AC 0 , many simple consequences of thermody¬ 
namics with respect to CVD can be understood. For example, consider the 
requirements for suitable chemical reactions in order to grow single-crystal 
films. In this case, it is essential that a single nucleus form as an oriented seed 
for subsequent growth. According to elementary nucleation theory, a small 
negative value of A G v , the chemical free energy per unit volume, is required 
to foster a low nucleation rate of large critical-sized nuclei (see Section 1.7). 
This, in turn, would require a AC° value close to zero. When this happens, 
large amounts of reactants and products are simultaneously present. If A G v 
were large, the likelihood of a high rate of heterogeneous nucleation, or even 
homogeneous nucleation of solid particles within the gas phase, would be 
enhanced. The large driving force for chemical reaction tends to promote 
polycrystal formation in this case. 

As an example, we follow the thought process involved in the design of a 
CVD reaction to grow crystalline Y 2 0 3 films. Following the treatment by 
Laudise (Ref. 11), consider the reaction 

2YCl 3(g) + (3/2)0*,, * Y 2 0 3(s) + 3C1 2( „ . (4-18) 

At 1000 K, A G° = -59.4 kcal/mole, corresponding to log K - +13. The 
reaction is thus too far to the right for practical film growth. If the chloride is 
replaced by a bromide or an iodide, the situation is worse. YBr 3 and YI 3 are 
expected to be less stable than YC1 3 , making AG° even more negative. The 
situation is improved by adding a gas-phase reaction with a positive value of 
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AG°: e.g., 

CO^, <=> CO (s) + (l/2)0 2((f> ; AG° = +46.7 kcal/mole. (4-19) 
Thus, the possible overall reaction is now 

2YC1 3(j?) + 3C0 2(g) * Y 2 0 3(j) + 3CO (g) + 3Cl 2(g) , (4-20) 

and AG° = -59.4 + 3(46.7) = +80.7 kcal/mole. The equilibrium now falls 
too far to the left, but by substituting YBr 3 and Br 2 for YC1 3 and Cl 2 , we 
change the sign of AG° once again. Thus, for 

2YBr Mg) + 3C0 2(g) ^ Y 2 0 3(J) + 3CO u) + 3Br 2U) , (4-21) 

AG° = -27 kcal/mole. 

Although a value of AG° closer to zero would be more desirable, this 
reaction yields partial pressures of YBr 3 equal to 1CT 2 atm when the total 
pressure is 2 atm. Growth in other systems has occurred at such pressures. 
Pending availability of YBr 3 in readily volatile form and questions related to 
the operating temperature and safety of handling reactants and products, Eq. 
4-21 appears to be a potential candidate for successful film growth. For 
analysis of chemical reactions, good values of thermodynamic data are essen¬ 
tial. Several sources of this information are listed among the references (Refs. 
12, 13). 

4.3.2. Conditions of Equilibrium 

Thermodynamics can provide us with much more than a prediciton of whether 
a reaction will proceed. Under certain circumstances, it can yield quantitative 
information on the operating intensive variables that characterize the equilib¬ 
rium. The problem is to evaluate the partial pressures or concentrations of the 
involved species within the reactor, given the reactant compositions and 
operating temperature. In practice, the calculation is frequently more compli¬ 
cated than initially envisioned, because in situ mass spectroscopic analysis of 
operating reactors has, surprisingly, revealed the presence of unexpected 
species that must be accounted for. For example, in the technologically 
important deposition of Si films, at least eight gaseous compounds have been 
identified during the reduction of chlorosilanes. The Si-Cl-H system has been 
much studied, and the following example illustrates the method of calculation 
(Refs. 14, 15). The most abundant chemical species in this system are SiCl 4 , 
SiCl 3 H, SiCl 2 H 2 , SiClH 3 , SiH 4 , SiCl 2 , HC1, and H 2 . These eight gaseous 
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species are connected by the following six 

equations of chemical equilibrium: 

SiCl 4( £) 4 2H 2(g) +*Si (s) 4- 4HC1 (£) ; 

A < ” p p2 > 

r ^SiCl 4 r ^H 2 

(4-22a) 

SiCl 3 H U) 4 H 2(g) <^Si (i) 4 3HCl (g) ; 

^SiCl3H^H 2 

(4-22b) 

SiCl 2 H 2(g) ^Si (:r) 4 2HCl (g) ; 

^ ( fl Si)^HCl 

A 3 - p 

SiCl 2 H 2 

(4-22c) 

SiClH 3(g) 4±Si (jr) 4 HCl (g) 4 H 2(g) ; 

v ( a Si) ^HCI-^Hz 

A 4 " p 

'“SiClH 3 

(4-22d) 

SiCl 2(5 ) 4 H 2(g) «=*Si (J) 4 2HCl (g) ; 

^ ( a Si) ^HCl 

A c — , 

5 p p 

■* SiCl 2 *H 2 

(4-22e) 

SiH 4 ( g )^Si (5) 4 2H 2(g) ; 

K _ ( a Si) P H 2 
^SiH 4 

(4-22f) 

The activity of solid Si, a Sl , is taken to be unity. 



To solve for the eight unknown partial pressures, we need two more 
equations relating these quantities. The first specifies that the total pressure in 
the reactor, which is equal to the sum of the individual partial pressures, is 
fixed, say at 1 atm. Therefore, 

^SiCl 4 + ^SiCI 3 H + Ps\C\ 2 H 2 + ^SiCIH 3 + ^SiH 4 + ^SiC] 2 + ^HCI + ^H 2 = 1 ■ 

(4-23) 

The final equation involves the Cl/H molar ratio, which may be taken to be 
fixed if neither Cl or H atoms are effectively added or removed from the 
system. Therefore, 

/ Cl \ 4P SiCl4 4- 3P SiCl3H 4 2P SiC , 2H2 + 2P SiCl2 4 P SiaH3 4- P HC , 

\ H / 2P H2 4 P S ici 3 H + 2P SiCl2H2 4- 3P SiC i H3 4 P HC1 4 4P Sih , 4 

(4-24) 

The numerator represents the total amount of Cl in the system and is equal to 
the sum of the Cl contributed by each specie. For example, the mass of Cl in 
SiCl 4 is given by m cl = 4A/ cl (/w SiC , 4 /A/ SiC , 4 ), where m and M refer to the 
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mass and molecular weight, respectively. But by the perfect gas law, 


m s\c\ 4 

M$iC \ 4 


^SiCl 4 ^ 

RT ’ 


and, therefore. 


number of moles of Cl = 


m 


ci 


M, 


ci 


4^sici V 
RT 


Similarly, for all other terms in the numerator and denominator. 

The common factor V / RT, involving the volume V and the temperature T 
of the reactor, cancels, and all that is left is given by Eq. 4-24. There are now 
eight independent equations relating the eight unknown partial pressures, 
which can be determined, at least in principle. First, however, the individual 
equilibrium constants K t must be specified, and this requires a slight excur¬ 
sion requiring additional thermodynamic calculation. The value of K t is fixed 
by specifying T and AG°. A convenient summary of thermodynamic data in 
the Si-Cl-H system is given in Fig. 4-4, where the free energies of compound 
formation are plotted versus temperature in an Ellingham-type diagram. Each 
line represents the equation A G° = A H° - T A S°, from which A//°, A5°, 
and A G° can be calculated for the compound in question at any temperature. 
For example, consider the formation reactions for SiCl 4 and HC1 at 1500 K. 



°K 

Figure 4-4. Free energies of formation of important gaseous species in the Si-Cl-H 
system in the temperature range 800-1600 K. (Reprinted by permission of the 
publisher. The Electrochemical Society, Inc. from Ref. 14). 
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TEMPERATURE (K) 

Figure 4-5. Composition of gas phase versus reactor temperature. Total pressure = 1 
atm, Cl/H = 0.01. (E. Sirtl, L. P, Hunt and D. H. Sawyer, J. Electrochem . Soc . 
121, 919(1974). 

From Fig. 4-4, 

Si + 2Cl 2 ^SiCl 4 , A G° = - 106 kcal/mole, 

(l/2)H 2 + (l/2)Cl 2 ?± HCl, AG° = -25 kcal/mole, 

SiCl 4 + 2H 2 * ± Si + 4HC1, AC° = +106 + 4(-25) = +6 kcal/mole. 

Therefore K ] = exp - 6000/(1.99)1500 = 0.13, and similarly for other val- 
ues of K. 

The results of the calculation are shown in Fig. 4-5 for a molar ratio of 
[Cl/H] = 0.01, which is typical of conditions used for epitaxial deposition of 
Si. Through application of an equation similar to 4-24, the molar ratio of 
[Si/Cl] was obtained and is schematically plotted in the same figure. A reactor 
operating temperature in the vicinity of 1400 K is suggested because the Si 
content in the gas phase is then minimized. Such temperatures are employed in 
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practice. Analogous calculations have also been made for the case where 
[Cl/H] = 0.1, which is typical of conditions favoring deposition of polycrys¬ 
talline Si. At equivalent temperatures, the [Si/Cl] ratios are somewhat higher 
than obtained for epitaxial deposition, reflecting the greater Si gas concentra¬ 
tion operative during poly crystal growth. In both cases, hydrogen is by far the 
most abundant species in the gas phase. 

Similar multispecies thermodynamic analyses involving simultaneous nonlin¬ 
ear equations have been made in a variety of semiconductor, oxide, nitride, 
and carbide systems. As a second example, consider the deposition of silicon 
carbide utilizing two independent source gases CH 4 and SiCl 4 (Ref. 16). 
These react according to Eq. 4-9. There are now four atomic species (C, Cl, 
H, and Si) rather than three for the deposition of Si. The Gibbs phase rule (Eq. 
1-18) can be useful in analyzing this situation. Since n = 4 and \p = 2 (solid 
SiC and gas), / = 4. This suggests that, in addition to fixing the temperature 
and pressure of the reactor, it is now necessary to specify two input molar 
ratios (i.e., [H/Cl], [Si/C]) to uniquely determine the system. The results of a 
thermodynamic analysis shown in Fig. 4-6 predicts that stoichiometric SiC will 
form for only those combinations of H /Cl and Si /C that fall on the darkened 
line. In actuality, SiC can be synthesized over a fairly broad range of input gas 
ratios at 1400 °C, as seen in the experimentally determined “phase diagram/’ 



H/CI RATIO 

Figure 4-6. Map of decomposition products for various [Si/C] and [H/Cl] ratios at 
1400 °C. (Reprinted with permission from JOM, (formerly J. of Metals) 28(6), 6 (1976), a 
publication of the Minerals, Metals & Materials Society). 
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This behavior is indicative of complex thermodynamic and reaction kinetics 
factors that are not easily accounted for. 


4.4. Gas Transport 


Gas transport is the process by which volatile species flow from one part of a 
reactor to another. It is important to understand the nature of gas transport 
phenomena in CVD systems for the following reasons: 

1. The deposited film or coating thickness uniformity depends on the delivery 
of equal amounts of reactants to all substrate surfaces. 

2. High deposition growth rates depend on optimizing the flow of reactants 
through the system and to substrates. 

3. More efficient utilization of process gases can be achieved as a result. 

4. The computer modeling of CVD processes can be facilitated, enabling 
improved reactor design and better predictive capability with regard to 
performance. 

At the outset, it is important to distinguish between diffusion and bulk flow 
processes in gases. Diffusion involves the motion of individual atomic or 
molecular species, whereas in bulk transport processes, such as viscous flow or 
convection, parts of the gas move as a whole. Different driving forces and 
resulting transport equations define and characterize these two broad types of 
gas flow. Each of these will now be discussed briefly as a prelude to 
considering the combinations of flow that take place in actual CVD reactors. 


4.4.1. Viscous Flow 

The viscous flow regime is operative when gas transport occurs at pressures of 
roughly 0.01 atm and above in reactors of typical size. This is also the pressure 
range characteristic of CVD systems. At typical flow velocities of tens of 
cm/sec, the reactant gases exhibit what is known as laminar or streamline 
flow. The theory of fluid mechanics provides a picture of what occurs under 
such circumstances. We shall consider the simplest of flow problems—that 
parallel to a flat plate. 

As shown in Fig. 4-7, the flow velocity has a uniform value u 0 , but only 
prior to impinging on the leading edge of the plate. However, as flow 
progresses, velocity gradients must form because the gas clings to the plate. 
Far away, the velocity is still uniform, but drops rapidly to zero at the plate 
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Figure 4-7. Laminar gas flow patterns: (top) flow across flat plate; (bottom) flow 
through circular pipe. 


surface, creating a boundary layer. The latter grows with distance along the 
plate and has a thickness d(x) given by 6 ( x) = 5x / >/Re*, where Re x is the 
Reynolds number, defined as Re^ = v 0 px/rj. The quantities 7 j and p are the 
gas viscosity and density, respectively. More will be said about 77 , but note that 
the viscosity essentially establishes the frictional viscous forces that decelerate 
the gas at the plate surface. 

The average boundary-layer thickness over the whole plate is 


_ 1 fL 10 LvV 10 L 

5 = 7/ &{x) dx = — / ■ = ~z / > 

LJ 0 3 y v 0 pL 3 yRe L 


(4-25) 


where Re L is defined as Re L = pv 0 L/r}. Because both gaseous reactants and 
products must pass through the boundary layer separating the laminar stream 
and film deposit, low values of 5 are desirable in enhancing mass-transport 
rates. This can be practically achieved by increasing the gas flow rate (u 0 ), 
which raises the value of Re. Typical values of Re in CVD reactors range up to 
a few hundred. If, however, Re exceeds approximately 2100, a transition from 
laminar to turbulent flow occurs. The resulting erratic gas eddies and swirls 
are not conducive to uniform defect-free film growth, and are to be avoided. 

It is instructive to now consider gas flow through a tube of circular cross 
section. The initial uniform axial flow velocity is altered after the gas enters 
the tube. Boundary layers develop at the walls and grow with distance along 



164 


Chemical Vapor Deposition 


the tube, as shown in Fig. 4-7. The Reynolds number is now given by 
— 2 pv 0 r 0 /ri, where r 0 is the tube radius. Beyond a certain critical entry 
length L e « 0.07r o Re, the flow is fully developed and the velocity profile no 
longer changes. At this point, the boundary layers around the tube circumfer¬ 
ence have merged, and the whole cross section consists of “boundary layer/’ 
The axial flow is now described by the Hagen-Poiseuille relation 

7T/*o A P 

V =-cm 3 /sec, (4-26) 

where V is the volumetric flow rate and AP/Ax is the pressure gradient 
driving force for viscous flow. The volumetric flow rate is defined as the 
volume of gas that moves per unit time through the cross section and is related 
to the average gas velocity u by V = irr^v. Within the tube, the gas velocity 
v(r) assumes a parabolic profile as a function of the radial distance r from the 
center, given by v(r) = u max (l - r 2 /^), where v max is the maximum gas 
velocity. The gas flux J is given by the product of the concentration of the 
species in question and the velocity with which it moves: 

J, = C-v,. (4-27) 

Upon substitution of C, = P ; / RT from the perfect gas law, and ^ = V/ tt r q , 
we have 

j = 

' RT Ax ‘ 

Provided the molar flux of any gaseous species in a chemical reaction is 
known, the fluxes of other species can be determined from the stoichiometric 
coefficients if equilibrium conditions prevail. 

Viscous flow is characterized by the coefficient of viscosity tj. The kinetic 
theory of gases predicts that 7/ varies with temperature as 7" 1/2 but is 
independent of pressure. Experimental data bear out the lack of a pressure 
dependance at least to several atmospheres, but indicate that q varies as T n , 
with n having values between 0.6 and 1.0. Gas viscosities typically range 
between 0.01 centipoise (cP) at 0 °C and 0.1 cP at 1000 °C (1 poise = 1 
dyne-sec cm -2 .) 

4.4.2. Diffusion 

The phenomenon of diffusion occurs in gases and liquids as well as in solids. If 
two different gases are initially separated and then allowed to mix, each will 
move from regions of higher to lower concentration, thus increasing the 
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entropy of the system. The process by which this occurs is known as diffusion 
and is characterized by Fick’s law (Section 1.6). Elementary kinetic theory of 
gases predicts that the diffusivity D depends on pressure and temperature as 
D ~ T 3/2 /P. It is therefore usual to represent D in gases by 

D = D °y(rJ- (4 - 29) 


where n is experimentally found to be approximately 1.8. The quantity D 0 , 
the value of D measured at standard temperature T 0 (273 K) and pressure P 0 
(1 atm), depends on the gas combination in question. Typical D 0 values at 
temperatures of interest span the range 0.1-10 cm 2 /sec and are many orders 
of magnitude higher than even the largest values for diffusivity in solids. If the 
gas composition is reasonably dilute so that the perfect gas law applies, 
C = P / RT and Eq. 1-21 can be equivalently expressed by 




D dPj 
~RT ~dx 


(4-30) 


This formula can be applied to the diffusion of gas through the stagnant 
boundary layer of thickness 5 adjacent to the substrate. The flux is then given 
by 


Dj P.-Pio) 

8RT 


(4-31) 


Here P i is the vapor pressure in the bulk gas and P i0 is the vapor pressure at 
the surface. 

Since D varies inversely with pressure, gas mass-transfer rates can be 
enhanced by reducing the pressure in the reactor. Advantage of this fact is 
taken in low-pressure CVD (LPCVD) systems, which are now extensively 
employed in semiconductor processing. Their operation will be discussed in 
Section 4.6.3. 

As an example that integrates both thermodynamics and diffusion in a CVD 
process, consider the deposition of CdTe films by close-spaced vapor transport 
(CSVT) (Ref. 17). In this process, mass is transferred from a solid CdTe 
source at temperature T, located a very short distance / (typically 1 mm) from 
the substrate maintained at T 2 (T x > T 2 ). Our objective is to establish 
conditions necessary to derive an expression for the film growth rate. We 
assume that chemical equilibrium prevails at the respective temperatures. The 
basic reaction is 

CdTe (J) * Cd (g) + 1 /2Te 2(g) , (4-32) 
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for which AG = 68.64 - 44.94 X 10 3 T kcal/mole. Therefore, the equa¬ 
tions 

AG(r.) 

PMP^iT,) = exp - = K(T ,), (4-33a) 

P C 6(T 2 )Pil 2 2 (T 2 ) = exp - = K{T 2 ) (4-33b) 

express the equilibria at source and substrate. If the concentrations of the 
gas-phase species vary linearly with distance, the individual mass fluxes (in 


units of moles /cm 2 -sec) are expressed by 



/ ^ Cd 1 
J cd — i | 

RT t 

PM) \ 
rt 2 )’ 

(4-34a) 

D Tc j 

T Ic 2 1 

TCl ~ l 1 

(PtcW 
[ RT ] 

^ > Tc 2 ( ^ 2 ) \ 

rt 2 )■ 

(4-34b) 


Note the use of the perfect gas law and the neglect of the temperature 
dependence of D. Maintenance of stoichiometry requires that 

J Ci = 2J Tei , (4-35) 

the factor of 2 arising because Te 2 is a dimer. The film growth rate is obtained 
from the relation 


G( /xm/min) = 


^Cd^CdTe(60 X 10 4 ) 


(4-36) 


where M and p are the molar mass and density of CdTe, respectively. If T l 
exceeds T 2 by approximately 100 °C, then Pi(T x ) > Pj(T 2 ), where / refers to 
both Cd and Te 2 . By neglecting the T 2 terms in Eqs. 4-34a and b, we write 


PJJ\) = 2^Te, 

^Te^^l) ^Cd 


(4-37) 


The value of 1.1 is derived from kinetic theory of gases, which suggests that 
D Cd = 1.85D Te2 in H 2 , He, or Ar ambients. Equations 4-33, 4-34, 4-35, and 
4-37 enable all the partial pressures to be determined. By knowing the value of 
D Cd or D Tc2 , G can be evaluated. 

Since its inception in 1963, CSVT has been used to grow a wide variety of 
semiconductor films for experimental purposes, including CdS, CdSe, GaP, 
GaAs, GaAs^-P^^, Hg t .^.Cd^Te, InP, ZnS, ZnSe, and ZnTe. 
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4.4.3. Convection 

Convection is a bulk gas flow process that can be distinguished from both 
diffusion and viscous flow. Whereas gas diffusion involves the statistical 
motion of atoms and molecules driven by concentration gradients, convection 
arises from the response to gravitational, centrifugal, electric, and magnetic 
forces. It is manifested in CVD reactors when there are vertical gas density or 
temperature gradients. An important example occurs in cold-wall reactors, 
such as depicted in Figure 4-13, where heated susceptors are surrounded above 
as well as on the sides by the cooler walls. Cooler, denser gases then lie above 
hotter, less dense gases. The resultant convective instability causes an over¬ 
turning of the gas by bouyancy effects. Subsequently, a complex coupling of 
mass and heat transfer serves to reduce both density and temperature gradients 
in the system. Another example of convective flow occurs in two-temperature- 
zone, vertical reactors. In the disproportionation process considered previ¬ 
ously, it is immaterial whether the hotter zone is physically located above or 
below the cooler zone insofar as thermodynamics is concerned. But efficient 
gas flow considerations mandate the placement of the cooler region on top to 
enhance gas circulation as shown in Fig. 4-2. 

Note that film growth is limited by viscous, diffusive, and convective mass 
transport fluxes, which, in turn, are driven by gas pressure gradients. In open 
reactors the metered gas (volumetric) flow rates establish these pressure 
gradients. In closed reactors the latter arise because of imposed temperature 
differences that locally alter the equilibrium partial pressures. 


4.5. Growth Kinetics 


The growth kinetics of CVD films depends on several factors associated with 
the gas-substrate interface, including 

a. Transport of reactants through the boundary layer to the substrate 

b. Adsorption of reactants at the substrate 

c. Atomic and molecular surface diffusion, chemical reactions, and incorpora¬ 
tion into the lattice 

d. Transport of products away from the substrate through the boundary layer 

The intimate microscopic details of these steps are usually unknown, and, 
therefore, the growth kinetics are frequently modeled in macroscopic terms. 
This is a simpler approach since it makes no atomistic assumptions but yet is 
capable of predicting deposit growth rates and uniformity within reactors. 
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4.5.1. Growth Rate Uniformity 

In what follows, a relatively simple approach to the analysis of epitaxial 
growth of Si in a horizontal reactor is considered. In particular, we are 
interested in knowing how uniform the deposit will be as a function of distance 
along the reactor. Although the specific reaction considered is the hydrogen 
reduction of chlorosilane, the results can be broadly applied to other CVD 
processes as well. 

In this treatment (Ref. 18), the reactor configuration is shown in Fig. 4-8a 
and the following is assumed: 

1. The gas has a constant velocity component along the axis tube. 

2. The whole system is at constant temperature. 

3. The reactor extends a large distance in the z direction so that the problem 
reduces to one of two dimensions. 


y = b 




(b) 

Figure 4-8. (a) Horizontal reactor geometry, (b) Variation of growth rate with 
position along susceptor. Reactor conditions: v = 7.5 cm/sec, b - 1.4 cm, T ~ 1200 
°C, and C, = 3.1 x 10“ 5 g/cm 3 . (Reprinted from P. C. Rundle, Int’l J. Electronics 
24, 405, © 1968 Taylor and Francis, Ltd.). 
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The flow is simply treated by assuming the mass flux J vector at any point to 
be composed of two terms: 

J = C(x y y)v - £> VC(x, jy). (4-38) 

The first term represents a bulk viscous (plug) flow where the source of 
concentration C(x , y) moves as a whole with drift velocity v. The second 
term is due to diffusion of individual gas molecules, with diffusivity D y along 
concentration gradients. Since two-dimensional diffusion is involved, both x 
and y components of VC must be considered. 

By taking the difference of the mass flux into and out of an elemental 
volume (as was done in Chapter 1) and equating it to the mass accumulation, 
we obtain 


3C(x,f) /d 2 C(x,.y) d 2 C(x,.y)\ 

dt " ( JP + dp / 


dC{x,y) 

v - 

dx 


(4-39) 


Only the steady-state solutions are of concern, so dC(x , t)/di = 0. The 
resulting equation is subject to three conditions: 


C = 0 

when y = 0, x > 0, 

(4-40a) 

dC 


(4-40b) 

— =0 
dy 

when y = b , x > 0, 

c = Cj 

atx = 0, Z? > .y > 0. 

(4-40c) 


The first condition assumes that the chemical reaction is complete at the 
substrate surface y = 0 and, therefore, the concentration of the Si containing 
source gas is zero there. The second condition implies that there is no net 
diffusive mass flux at the top of the reactor. Gas molecules impinging at 
surface y - b are merely reflected back into the system. The final boundary 
condition states that the input source gas concentration is C,, a constant. 

Equations 4-39 and 4-40 specify a boundary value problem, and the well- 
known techniques of partial differential equations involving separation of 
variables or Laplace transform methods gives as the solution 


4C\ _ 

c{x,y) = — £ 


l 


7T n = 0 2/1 + 1 


sin 


iry 

(2» + .)^ 


D 


xex P t ; - Vis + ( 2 « + 0 u 


D 


x. 


(4-41) 


To obtain a more readily usable form of the solution, we assume as a first 
approximation that vb > Dir. Except for short distances into the reactor or 
small values of x, only the first term in the series need be retained. These 
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simplications give 

4C. / 7T_V \ tt 2 Dx 

C( X ,,) = —S i„(—)exp- w . (442) 

The flux of source gas to the substrate surface is given by 

dC(x,y) 

J(x ) = —D - g/cm 2 sec. (4-43) 

d y y =0 

The resultant deposit growth rate G(jc) is related to J(x) through simple 
material constants by 

M si 

G(x) = -/(a*) cm/sec (4-44) 

pM s 


where M si and M s are the molecular weights of the Si and source gas, 
respectively, and p is the density of Si. Combining Eqs. (4-42)-(4-44) yields 


<?(*) 


2 CjMsi 

b P M s 


Bexp - 


tt 2 Dx 
4U6 2 ' 


(445) 


The values of D , v , and C, are strictly those pertaining to the mean 
temperature of the reactor, T. An exponential decay in the Si growth rate with 
distance along the reactor is predicted. This is not too surprising, since the 
input gases are progressively depleted of reactants. The implicit boundary 
condition requiring that C = 0 at x * oo accounts for this loss. Despite the 
extreme simplicity of the assumptions, the model provides rather good agree¬ 
ment with experimental data on the variation of Si growth rate with distance, 
as indicated in Fig. 4-8b. 

From the standpoint of reactor performance, high growth rates and uniform 
deposition are the two most important concerns, assuming film quality is not 
compromised. The equation for G(jc) provides design guidelines, but they are 
not always simple to implement. Tilting the susceptor as shown in Fig. 4-13 is 
an effective way to improve growth uniformity. The increase in gas flow 
velocity in the tapered space above susceptors within horizontal or barrel 
reactors serves to decrease 6. Enhanced transport across the stagnant layer then 
compensates for reactant depletion. Another simple remedy is to continuously 
increase the temperature downstream within the reactor. 


4.5.2. Temperature Dependence 

It is instructive to reproduce the treatment of kinetics of film growth by Grove 
(Ref. 19), to understand the effect of temperature. The essentials of this simple 
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model are shown in Fig. 4-9, where the environment in the vicinity of the 
gas-growing film interface is shown. A drop in concentration of the reactant 
from C g in the bulk of the gas to C s at the interface occurs. The corresponding 
mass flux is given by 

J g s = h g {C g -C s ), (4-46) 

where h g is the gas-phase mass-transfer coefficient, to be defined later. The 
flux consumed by the reaction taking place at the surface of the growing film is 
approximated by 

J s = k s C s , (4-47) 

where first-order kinetics are assumed, and k s is the rate constant for surface 
reaction. In the steady state, J gs = J s , so 

Cj= l + k,/h t - 

This formula predicts that the surface concentration drops to zero if k s > h g , 
a condition referred to as mass-transfer control. In this case, low gas transport 
through the boundary layer limits the otherwise rapid surface reaction. 

Conversely, surface reaction control dominates where h g > k s , in which 
case C s approaches C g . Here the surface reaction is sluggish even though 
sufficient reactant gas is available. The film growth rate G is given by 
G ~ J s /N 0 , where N 0 is the atomic density or number of atoms incorporated 




Figure 4-9. Model of growth process. Gas flows normal to plane of paper. (Re¬ 
printed with permission from John Wiley and Sons, from A. S. Grove, Physics and 
Technology of Semiconductor Devices, Copyright © 1967, John Wiley and Sons). 
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Figure 4-10. Deposition rate of Si from four different precursor gases as a function 
of temperature. (From W. Kern in Microelectronic Materials & Processes , ed. by 
R. A. Levy, reprinted by permission of Kluwer Academic Publishers, 1989). 


into the film per unit volume. Therefore, 

q _ k s h g C g 
k s + h g N 0 

The temperature dependence of G hinges on the properties of k s and h g . A 
Boltzmann factor behavior dominates the temperature dependence of k s \ i.e., 
k s ~ exp — E/RT , where E is the characteristic activation energy involved. 
Comparison of Eqs. 4-31 and 4-46 reveals that h g is related to D/b. Since D g 
varies as T 2 at most and b is weakly dependent on T, h g is relatively 
insensitive to variations in temperature. At low temperatures, film growth is 
surface-reaction-controlled; i.e., G = k s C g /N 0 . At high temperatures, how¬ 
ever, the mass transfer or diffusion-controlled regime is accessed where 
G = h g C g /N 0 . The predicted behavior is borne out by growth rate data for 
epitaxial Si, as shown in the Arrhenius plots of Fig. 4-10. Actual film growth 
processes are carried out in the gas diffusion-controlled region, where the 
temperature response is relatively flat. At lower temperatures the same activa¬ 
tion energy of about 1.5 eV is obtained irrespective of the chlorosilane used. 
Migration of Si adatoms is interpreted to be the rate-limiting step in this 
temperature regime. 


(4-49) 
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Table 4-2. Influence of Process Conditions on Kinetics of Si Deposition 
From SiHCl 3 -H 2 andSiCl 4 -H 2 Mixtures 


Variable 

Diffusion 

Controlled 


Reaction 

Controlled 

1. Linear flow rate 

Low (la) 

Medium 

High (lb) 

2. Mole fraction chlorosilane 

Low (2a) 

Medium 

High (2b) 

3. Substrate temperature 

High (3a) 

Medium 

Low (3b) 

4. Temperature gradient (near surface) 

Low 

Medium 

High 

5. Surface site density 

High 

Medium 

Low 

6. Silicon surface per reaction volume 

High 

Medium 

Low 


la 

< 0.3 cm/sec 

lb. 

> 3 cm/sec 

2a 

< 0.01 

2b. 

> 0.1 

3a 

> 1550 K 

3b 

< 1300 K 

From Ref. 15. 




Regardless of the different types of equipment and deposition conditions 
employed, general outlines can be given as to what extent either diffusion-con¬ 
trolled or surface-controlled processes dominate. In Table 4-2 the influence of 
the most important variables is shown. Depending on whether the balance of 
conditions lies to the left or to the right of center, a reasonable prediction of 
operative kinetic mechanism can be made. 

4.5.3. Thermodynamic Considerations 

The previous discussion implies that ail reaction rates increase with tempera¬ 
ture. Though generally true, it is sometimes observed that higher reactor 
temperatures lead to lower film growth rates in certain systems. This apparent 
paradox can be explained by considering the reversibility of chemical reac¬ 
tions. In Chapter 1 the net rate for a forward exothermic reaction (and reverse 
endothermic reaction) was given by Eq. 1-36 and modeled in Fig. 1-18. Recall 
that exothermic reactions mean that the sign of A H° is negative; the reactants 
have more energy than the products. For endothermic reactions, A H° is 
positive. The individual forward and reverse reaction components are now 
shown in Fig. 4-1 la on a common plot. Clearly, the activation energy barrier 
(or slope) for the reverse reaction exceeds that for the forward reaction. The 
net reaction rate or difference between the individual rates is also indicated. 
Interestingly, it reaches a maximum and then drops with temperature. A 
practical manifestation of this is etching—the reverse of deposition—in the 
high-temperature range. 

In Fig. 4-1 lb the alternate case is considered—a forward endothermic 
deposition reaction and a reverse exothermic reaction. Here, the net reaction 
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rate increases monotonically with temperature, and film growth rates will 
always increase with temperature. It is left for the reader to show that A H° 
for the reduction of SiCl 4 by H 2 (Eq. 4-3) is endothermic as written; e.g., 
AH a = +60 kcal/mole. There is actually a thermodynamic driving force that 
tends to transport Si from the cooler regions (i.e., walls) to the hottest part of 
the reactor. This is where inductively heated substrates are placed and where 
film growth rates are highest. Epitaxial Si is most efficiently deposited in 
cold-wall reactors for this reason. 

The opposite is true when A H* <0. Reversible reactions (Eq. 4-14) for the 
deposition of GaAs are exothermic, and hot-wall reactors are employed in this 
case to prevent deposition on the walls. 

These results are a direct consequence of the well-known van’t Hoff 
equation 

dlnK^ A H° 
dT ~ RT 2 ’ 

where K ^ is the reaction equilibrium constant. 

4.5.4. Structure 

The actual film and coating structural morphologies that develop during CVD 
are the result of a complex sequence of atomic migration events on substrates 
leading to observable nucleation and growth processes (Ref. 21). Since the 
kinetic details are similar in all film formation and growth processes irrespec¬ 
tive of whether deposition is chemical or physical in nature, they will be 
treated within a common framework in Chapter 5. Perhaps the two most 
important variables affecting growth morphologies are vapor supersaturation 
and substrate temperature. The former influences the film nucleation rate, 
whereas the latter affects the growth rate. In concert they influence whether 
epitaxial films, platelets, whiskers, dendrites, coarse-grained polycrystals, 
fine-grained polycrystals, amorphous deposits, gas-phase powder, or some 
combination of these form. Thus, single-crystal growth is favored by low gas 
supersaturation and high substrate temperatures, whereas amorphous film 
formation is promoted at the opposite extremes. An example of the effect of 
substrate temperature on the structure of deposited Si films is shown in Fig. 
4-12 (Ref. 22). Decomposition of silane at temperatures of about 600 °C and 
below yields amorphous films with no detectable structure. Polysilicon de¬ 
posited from 600 to 650 °C has a columnar structure with grain sizes ranging 
from 0.03 to 0.3 ju,m and possesses a {110} preferred orientation. Larger Si 
crystallites form at higher temperatures, and eventually single-crystal film 
growth can be achieved at 1200 °C. 
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Figure 4-12. Morphology of poly-Si deposited from SiH 4 on an Si0 2 substrate: 
(top) columnar grains deposited above 650 °C, (middle) fine-grained poly Si; (bottom) 
partly amorphous structure deposited at 625 °C. Deposition temperatures below 600 °C 
produce an amorphous film. Courtesy of R. B. Marcus Bellcore Corp. (Reprinted with 
permission from John Wiley and Sons, from R. B. Marcus and T. T. Sheng, 
Transmission Electron Microscopy of Silicon VLSI Circuits and Structures , Copy¬ 
right © 1983, John Wiley and Sons). 
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4.6. CVD Processes and Systems 


The great variety of materials deposited by CVD methods has inspired the 
design and construction of an equally large number of processes and systems. 
These have been broadly categorized and described by such terms as low and 
high temperature, atmospheric and low pressure, cold and hot wall, closed and 
open in order to differentiate them. Incorporation of physical deposition 
features such as plasmas and evaporation sources has further enriched and 
expanded the number of potential CVD processes. Within a specific category, 
the variations in design and operating variables frequently make it difficult to 
compare performance of individual systems or reactors, even when depositing 
the same material. Regardless of process type, however, the associated equip¬ 
ment must have the following capabilities: 

1. Deliver and meter the reactant and diluent gases into the reactor. 

2. Supply heat to the substrates so that reaction and deposition can proceed 
efficiently. 

3. Remove the by-product and depleted gases. 

We begin with a discussion of atmospheric pressure processes and distin¬ 
guish between the low- and high-temperature reactors employed. More infor¬ 
mation exists on CVD applications and equipment for microelectronic device 
fabrication and the subsequent discussion will reflect this bias. 


4.6.1. Low-Temperature Systems 

In the fabrication of both Si bipolar and MOS integrated circuits there is an 
important need to deposit thin films of Si0 2 , phosphosilicate (PSG) and 
borophosphosilicate (BPSG) glasses, and silicon nitride films in order to 
insulate, passivate, or hermetically seal various parts of the underlying cir¬ 
cuitry. At the present time, an upper temperature limit that can be tolerated is 
- 450 °C because the A1 metallization used for device contacts and intercon¬ 
nections begins to react with Si beyond this point. Several types of atmospheric 
pressure, low-temperature reactors have been devised for the purpose of 
depositing these insulator films. They include resistance-heated rotary reactors 
of radial configuration and reactors featuring a close-spaced nozzle geometry. 
In the latter, gases impinge on wafers translated past the nozzles by a metal 
conveyer belt. Films of Si0 2 are deposited at 325-450 °C from SiH 4 + 0 2 
mixtures diluted with N 2 . Co-oxidation with PH 3 yields PSG, and PH 3 -I- 
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B 2 H 6 mixtures generate BPSG. As noted in Section 4.6.3, LPCVD processes 
have largely surpassed atmospheric CVD methods for depositing such films. 


4.6.2. High-Temperature Systems 

There is need to reduce semiconductor processing temperatures, but the growth 
of high-quality epitaxial thin films can only be achieved by high-temperature 
CVD methods. This is true of Si as well as compound semiconductors. 
High-temperature atmospheric systems are also extensively employed in metal¬ 
lurgical coating operations. The reactors can be broadly divided into hot-wall 
and cold-wall types. Hot-wall reactors are usually tubular in form, and heating 
is accomplished by surrounding the reactor with resistance elements. An 
example of such a reactor for the growth of single-crystal compound semicon¬ 
ductor films by the hydride process was given in Fig. 4-3. Higher temperatures 
are maintained in the source and reaction zones (~ 800-850 °C) relative to the 
deposition zone (700 °C). Prior to deposition, the substrate is sometimes 
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Figure 4-13. Schematic diagrams of reactors employed in epitaxial Si deposition: 
(top) horizontal; (lower left) pancake; (lower right) barrel. (Reprinted with permission 
from John Wiley and Sons, from S. M. Sze, Semiconductor Devices: Physics and 
Technology , Copyright © 1985, John Wiley and Sons). 
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etched by raising its temperatures to 900 °C. Provision for multiple tempera¬ 
ture zones is essential for efficient transport of matrix as well as dopant atoms. 
By programming flow rates and temperatures, the composition, doping level 
and layer thickness can be controlled, making it possible to grow complex 
multilayer structures for device applications. 

Cold-wall reactors are utilized extensively for the deposition of epitaxial Si 
films. Substrates are placed in good thermal contact with SiC-coated graphite 
susceptors, which can be inductively heated while the nonconductive chamber 
walls are air- or water-cooled. Three popular cold-wall reactor configurations 
are depicted in Fig. 4-13 (Ref. 23). Of note in both the horizontal and barrel 
reactors are the tilted susceptors. This feature compensates for reactant deple¬ 
tion, which results in progressively thinner deposits downstream as previously 
discussed. In contrast to the other types, the wafer substrates lie horizontal in 
the pancake reactor. Incoming reactant gases flow radially over the substrates 
where they partially mix with the product gases. Cold-wall reactors typically 
operate with H 2 flow rates of 100-200 (standard liters per minute) and 1 vol% 
of SiCl 4 . Silicon crystal growth rates of 0.2 to 3 fx m/min are attained under 
these conditions. Substantial radiant heat loss from the susceptor surface and 
consumption of large quantities of gas, 60% of which is exhausted without 
reacting at the substrate, limit the efficiency of these reactors. 

4.6.3. Low-Pressure CVD 

One of the more recent significant developments in CVD processing has been 
the introduction of low-pressure reactor systems for use in the semiconductor 
industry. Historically, LPCVD methods were first employed to deposit poly sil¬ 
icon films with greater control over stoichiometry and contamination problems. 
In practice, large batches of wafers, say 100 or more, can be processed at a 
time. This coupled with generally high deposition rates, improved film thick¬ 
ness uniformity, better step coverage, lower particle density, and fewer pinhole 
defects has given LPCVD important economic advantages relative to atmo¬ 
spheric CVD processing in the deposition of dielectric films. 

The gas pressure of - 0.5 to 1 torr employed in LPCVD reactors distin¬ 
guishes it from conventional CVD systems operating at 760 torr. To compen¬ 
sate for the low pressures, the input reactant gas concentration is correspond¬ 
ingly increased relative to the atmospheric reactor case. Low gas pressures 
primarily enhance the mass flux of gaseous reactants and products through the 
boundary layer between the laminar gas stream and substrates. According to 
Eq. 4 -31, the mass flux of the gaseous specie is directly proportional to D/d. 
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Since the diffusivity varies inversely with pressure, D is roughly 1000 times 
higher in the case of LPCVD. This more than offsets the increase in b, which 
is inversely proportional to the square root of the Reynolds number. In an 
LPCVD reactor, the gas flow velocity is generally a factor of 10-100 times 
higher, the gas density a factor of 1000 lower, and the viscosity unchanged 
relative to the atmospheric CVD case. Therefore, Re is a factor of 10 to 100 
times lower, and b is about 3 to 10 times larger. Because the change in D 
dominates that of <5, a mass-transport enhancement of over an order of 
magnitude can be expected for LPCVD. The increased mean-free path of the 
gas molecules means that substrate wafers can be stacked closer together, 
resulting in higher throughputs. When normalized to the same reactant partial 
pressure, LPCVD film growth rates exceed those for conventional atmospheric 
CVD. 

The commercial LPCVD systems commonly employ horizontal hot-wall 
reactors like that shown in Fig. 4-14. These consist of cylindrical quartz tubes 
heated by wire-wound elements. Large mechanical pumps as well as blower 
booster pumps are required to accommodate the gas flow rates employed—e.g., 
50-500 standard cm 3 /min at 0.5 torr—and maintain the required operating 
pressure. One significant difference between atmospheric and LPCVD systems 
concerns the nature of deposition on reactor walls. Dense adherent deposits 
accumulate on the hot walls of LPCVD reactors, whereas thinner, less 
adherent films form on the cooler walls of the atmospheric reactors. In the 
latter case, particulate detachment and incorporation in films is a problem, 
especially on horizontally placed wafers. It is less of a problem for LPCVD 
reactors where vertical stacking is employed. Typically, 100 wafers, 15 cm in 
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Figure 4-14. Schematic diagram of hot-wall reduced pressure reactor (From 
Ref. 24). 
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diameter, can be processed per hour in this reactor. In addition to polysilicon 
and dielectric films, silicides and refractory metals have been deposited by 
LPCVD methods. 

4.6.4. Plasma-Enhanced CVD 

In PECVD processing, glow discharge plasmas are sustained within chambers 
where simultaneous CVD reactions occur. The reduced-pressure environment 
utilized is somewhat reminiscent of LPCVD systems. Generally, the radio 
frequencies employed range from about 100 kHz to 40 MHz at gas pressures 
between 50 mtorr to 5 torr. Under these conditions, electron and positive-ion 
densities number between 10 9 and 10 12 /cm 3 , and average electron energies 
range from 1 to 10 eV. This energetic discharge environment is sufficient to 
decompose gas molecules into a variety of component species, such as elec¬ 
trons, ions, atoms, and molecules in ground and excited states, free radicals, 
etc. The net effect of the interactions among these reactive molecular frag¬ 
ments is to cause chemical reactions to occur at much lower temperatures than 
in conventional CVD reactors without benefit of plasmas. Therefore, previ¬ 
ously unfeasible high-temperature reactions can be made to occur on tempera¬ 
ture-sensitive substrates. 

In the overwhelming majority of the research and development activity in 
PECVD processing, the discharge is excited by an rf field. This is due to the 
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Figure 4-15. Typical cylindrical, radial flow, silicon nitride deposition reactor (From 
Ref. 26). 
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fact that most of the films deposited by this method are dielectrics, and dc 
discharges are not feasible. The tube or tunnel reactors employed can be 
coupled inductively with a coil or capacitively with electrode plates. In both 
cases, a symmetric potential develops on the walls of the reactor. High wall 
potentials are avoided to minimize sputtering of wall atoms and their incorpo¬ 
ration into growing films. 

A major commercial application of PECVD processing has been to deposit 
silicon nitride films in order to passivate and encapsulate completely fabricated 
microelectronic devices. At this stage the latter cannot tolerate temperatures 
much above 300 °C. A parallel-plate, plasma deposition reactor of the type 
shown in Fig. 4-15 is commonly used for this purpose. The reactant gases first 
flow through the axis of the chamber and then radially outward across rotating 
substrates that rest on one plate of an rf-coupled capacitor. This diode 
configuration enables a reasonably uniform and controllable film deposition to 
occur. The process is carried out at low pressures to take advantage of 
enhanced mass transport, and typical deposition rates of about 300 A/min are 
attained at power levels of 500 W. Silicon nitride is normally prepared by 
reacting silane with ammonia in an argon plasma, but a nitrogen discharge with 


Table 4-3. Physical and Chemical Properties of Silicon Nitride Films 
from SiH 4 + NH 3 


Property 

Si 3 N 4 

1 atm CVD 

900 °C 

Si 3 N 4 (H) 

LPCVD 

750 °C 

Si^N H z 

PECVD 

300 °C 

Density (g/cm 3 ) 

2.8-3.1 

2.9-3.1 

2.5-2.8 

Refractive index 

2.0-2.1 

2.01 

2.0-2.1 

Dielectric constant 

Dielectric breakdown 

6-7 

6-7 

6-9 

field (V/cm) 

10 7 

10 7 

6 x 10 6 

Bulk resistivity (0 cm) 

Stress at 23 °C on Si 

10 15 -10 17 

10 16 

10 15 

(dynes/cm 2 ) 

1.5 x 10 10 (T) 

10 10 (T) 

1 - 8 x 10 9 (C) 

Color transmitted 

None 


Yellow 

H 2 0 permeability 

Zero 


Low-none 

Thermal stability 

Excellent 


Variable > 400 °C 

Si/N ratio 

0.75 

0.75 

0.8-1.0 

Etch rate, 49% HF (23 °C) 

80 A/min 


1500-3000 A/min 

Na + penetration 

< 100 A 


< 100 A 

Step coverage 

Fair 


Conformal 


Note: T = tensile; C = compressive. 
Adapted from Refs. 24, 25. 
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Table 4-4. PECVD Reactants and Products, Deposition Temperatures, and Rates 


Deposit 

T (K) 

Rate (cm/sec) 

Reactants 

a-Si 

573 

10" 8 -10‘ 7 

SiH 4 ; SiF 4 -H 2 ; Si(ri-H 2 

c-Si 

673 

i0“ 8 -i<r 7 

SiH 4 -H 2 ;SiF 4 -H 2 ;Si(*)-H 2 

a-Ge 

673 

10" 8 -10" 7 

GeH 4 

c-Ge 

673 

10 s -10“ 7 

GeH 4 -H 2 ; Ge(j)-H 2 

a-B 

673 

10 _s -10“ 7 

B 2 H 6 ; BC1 3 -H 2 ; BBr 3 

a-P, c-P 

293-473 

10- 5 

P(j)-H 2 

As 

< 373 

10" 6 

AsH 3 ; As(j)-H 2 

Se, Te, Sb, Bi 

373 

10" 7 -10” 6 

Me-H 2 

Mo 



Mo(CO) 4 

Ni 



Ni(CO) 4 

C (graphite) 

1073-1273 

10“ s 

C(i)-H,;C(s)-N 2 

CdS 

373-573 

10" 6 

Cd-H 2 S 

Oxides 




Si0 2 

523 

10- 8 -10 _<1 

Si(OC 2 H 5 ) 4 ; SiH 4 -0 2 , N 2 0 

GeO 2 

523 

10-8-10- 6 

Ge(OC 2 H 5 ) 4 ; GeH 4 -0 2 , N 2 0 

Si0 2 /Ge0 2 

1273 

3 X 10“ 4 

SiCl 4 -GeCl 4 + 0 2 

ai 2 o 3 

523-773 

10" 8 -10" 7 

aici 3 -o 2 

Ti0 2 

473-673 

I0- 8 

TiCl 4 -0 2 ; metallorganics 

b 2 o 3 



b(oc 2 h 5 ) 3 -o 2 

Nitrides 




Si 3 N 4 (H) 

573-773 

10" 8 -10‘ 7 

SiH 4 -N 2 , NH 3 

AIN 

1273 

10“ 6 

aici 3 -n 2 

GaN 

873 

10“ 8 -10“ 7 

GaCI 3 -N 2 

TiN 

523-1273 

IO“ s -5 x lO -8 

TiCl 4 -H 2 + N 2 

BN 

673-973 


b 2 h 6 -nh 3 

P 3 N 5 

633-673 

5 x KT 6 

P(.s)-N 2 ; PH 3 -N 2 

Carbides 




SiC 

473-773 

10" 8 

SiH 4 -C n H m 

TiC 

673-873 

5 x 10“ 8 -10“ 6 

TiCl 4 -CH 4 + H 2 


673 

10“ 8 -10- 7 

b 2 h 6 -ch 4 


From Ref. 27. 


silane can also be used. As much as 25 al% hydrogen can be incorporated in 
plasma silicon nitride, which may, therefore, be viewed as a temai 7 solid 
solution. This should be contrasted with the stoichiometric compound Si 3 N 4 , 
formed by reacting silane and ammonia at 900 °C in an atmospheric CVD 
reactor. It is instructive to further compare the physical and chemical property 
differences in three types of silicon nitride, and this is done in Table 4-3. 
Although Si 3 N 4 is denser, more resistant to chemical attack, and has higher 
resistivity and dielectric breakdown strength, SiNH tends to provide better step 
coverage. 
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Some elements, such as carbon and boron, in addition to metals, oxides, 
nitrides, and silicides, have been deposited by PECVD methods. Operating 
temperatures and nominal deposition rates are included in Table 4-4. An 
important recent advance in PECVD relies on the use of microwave—also 
called electron cyclotron resonance (ECR)—plasmas. As the name implies, 
microwave energy is coupled to the natural resonant frequency of the plasma 
electrons in the presence of a static magnetic field. The condition for energy 
absorption is that the microwave frequency o) m (commonly 2.45 GHz) be equal 
to qB/m , where all terms were previously defined in connection with 
magnetron sputtering (Section 3.7.3). Physically, plasma electrons then un¬ 
dergo one circular orbit during a single period of the incident microwave. 
Whereas rf plasmas contain a charge density of — 10 10 cm -3 in a 10" 2 -to-l- 
torr environment, the ECR discharge is easily generated at pressures of 10" 5 
to 10" 3 torr. Therefore, the degree of ionization is about 1000 times higher 
than in the rf plasma. This coupled with low-pressure operation, controllability 
of ion energy, low-plasma sheath potentials, high deposition rates, absence of 
source contamination (no electrodes!), etc., has made ECR plasmas attractive 
for both film deposition as well as etching processes. A reactor that has been 
employed for the deposition of Si0 2 , A1 2 0 3 , SiN, and Ta 2 0 5 films is shown 
in Fig. 4-16. A significant benefit of microwave plasma processing is the 
ability to produce high-quality films at low substrate temperatures. 


MICROWAVE 2.45 GHz 



Figure 4-16. ECR plasma deposition reactor. (From Ref. 28, with permission from 
Noyes Publications). 
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4.6.5. Laser-Enhanced CVD 

Laser or, more generally, optical chemical processing involves the use of 
monochromatic photons to enhance and control reactions at substrates. Two 
mechanisms are involved during laser-assisted deposition, and these are illus¬ 
trated in Fig. 4-17. In the pyrolytic mechanism the laser heats the substrate to 
decompose gases above it and enhance rates of chemical reactions there. 
Pyrolytic deposition requires substrates that melt above the temperatures 
necessary for gas decomposition. Photolytic processes, on the other hand, 
involve direct dissociation of molecules by energetic photons. Ultraviolet light 
sources are required because many useful parent molecules (e.g., SiH 4 , 
Si 2 H 6 , Si 3 H 8 , N 2 0) require absorption of photons with wavelengths of less 
than 220 nm to initiate dissociation reactions. The only practical continuous- 
wave laser is the frequency-doubled Ar + at 257 nm with a typical power of 20 
mW. Such power levels are too low to enable high deposition rates over large 
areas but are sufficient to “write” or initiate deposits where the scanned light 
beam hits the substrate. Similar direct writing of materials has been accom¬ 
plished by pyrolytic processes. Both methods have the potential for local 
deposition of metal to repair integrated circuit chips. 

A number of metals such as Al, Au, Cr, Cu, Ni, Ta, Pt, and W have been 

LASER-ASSISTED DEPOSITION 



PHOTOLYTIC 
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Figure 4-17. Mechanisms of laser-assisted deposition. (Reproduced with permission 
from Ref, 29, © 1985 by Annual Reviews Inc.). 
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deposited through the use of laser processing. For photolytic deposition, 
organic metal dialkyl and trialkyls have yielded electrically conducting de¬ 
posits. Carbonyls and hydrides have been largely employed for pyrolytic 
depositions. There is frequently an admixture of pyrolytic and photolytic 
deposition processes occurring simultaneously with deep UV sources. Alterna¬ 
tively, pyrolytic deposition is accompanied by some photodissociation of 
loosely bound complexes if the light source is near the UV. 

Dielectric films have also been deposited in low-pressure photosensitized 
CVD processes (Ref. 30). The photosensitized reaction of silane and hydrazine 
yields silicon nitride films, and Si0 2 films have been produced from a gas 
mixture of SiH 4 , N 2 0, and N 2 . In Si0 2 , deposition rates of 150 A/min at 
temperatures as low as 50 °C have been reported (Ref. 23), indicating the 
exciting possibilities inherent in such processing. 


4.6.6. Metalorganic CVD (MOCVD) (Ref. 31) 

Also known as OMVPE (organometallic vapor phase epitaxy), MOCVD has 
presently assumed considerable importance in the deposition of epitaxial 
compound semiconductor films. Unlike the previous CVD variants, which 
differ on a physical basis, MOCVD is distinguished by the chemical nature of 
the precursor gases. As the name implies, metalorganic compounds like 
trimethyl-gallium (TMGa), trimethyl-indium (TMIn), etc, are employed. They 
are reacted with group V hydrides, and during pyrolysis the semiconductor 
compound forms; e.g., 

(CH 3 ) 3 Ga U) + AsH 3(g) - GaAs (J) + 3CH 4(g) (4-51) 

Group V organic compounds TMAs, TEAs (triethyl-arsenic), TMP, TESb, 
etc., also exist, so that all-organic pyrolysis reactions have been carried out. 
The great advantage of using metalorganics is that they are volatile at moder¬ 
ately low temperatures; there are no troublesome liquid Ga or In sources in the 
reactor to control for transport to the substrate. Carbon contamination of films 
is a disadvantage, however. Since all constituents are in the vapor phase, 
precise electronic control of gas flow rates and partial pressures is possible. 
This, combined with pyrolysis reactions that are relatively insensitive to 
temperature, allows for efficient and reproducible deposition. Utilizing com¬ 
puter-controlled gas exchange and delivery systems, epitaxial multilayer semi¬ 
conductor structures with sharp interfaces have been grown in reactors such as 
shown in Fig. 4-18. In addition to GaAs, other III-V as well as II-VI and 
IV-VI compound semiconductor films have been synthesized. Table 4-5 lists 
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FLASK 


Figure 4-18. Schematic diagram of a vertical atmospheric-pressure MOCVD reac¬ 
tor. (Reprinted with permission. From R. D. Dupuis, Science 226, 623, 1984). 


Table 4-5. Organo Metallic Precursors and Semiconductor Films 
Grown by MOCVD 


Compound 

Reactants 

Vapor Pressure* of 
OM precursor 

a b 

Growth Temperature 
( °C) 

AlAs 

TMA1 + AsH 3 

8.224 

2135 

700 

AIN 

TMA1 + NH 3 



1250 

GaAs 

TMGa + AsH 3 

8.50 

1824 

650-750 

GaN 

TMGa + NH 3 



800 

GaP 

TMGa + PH 3 



750 

GaSb 

TEGa + TMSb 

9.17 

2532 

500-550 



7.73 

1709 


InAs 

TEIn + AsH 3 



650-700 

InP 

TEIn + PH 3 



725 

ZnS 

DEZn + H 2 S 

8.28 

2190 


ZnSe 

DEZn + H 2 Se 




CdS 

DMCd + H 2 S 

7.76 

1850 


HgCdTe 

Hg + DMCd + DMTe 

7.97 

1865 


CdTe 

DMCd + DMTe 





*log P(torr) = a — b j T K 
Adapted from Ref, 31. 
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some films formed on insulating and semiconducting substrates together with 
corresponding reactants and film growth temperatures. 

Film growth rates (G) and compositions directly depend on gas partial 
pressures and flow rates (V). For Al^Ga^^As films, 


G = K{T)[P Ci V Ca + 2P m V m \, 

2-^AI^AI PciYo-d 


(4-51) 

(4-52) 


In these equations K{T) is a temperature-dependent constant, and the factor of 
2 enters because trimethyl-aluminum is a dimer. MOCVD has been particu¬ 
larly effective in depositing films for a variety of visible and long-wavelength 
lasers as well as quantum well structures. The use of these precursor gases is 
not only limited to semiconductor technology; volatile organo-Y, Ba, and Cu 
compounds have been explored in connection with the deposition of high-tem¬ 
perature superconducting films having the nominal composition YBa 2 Cu 3 0 7 . 


4.6.7. Safety 

The safe handling of gases employed in CVD systems is a concern of 
paramount importance. Because the reactant or product gases are typically 
toxic, flammable, pyrophoric, or corrosive, and frequently possess a combina¬ 
tion of these attributes, they present particular hazards to humans. Exposure of 
reactor hardware and associated gas-handling equipment to corrosive environ¬ 
ments also causes significant maintenance problems and losses due to down¬ 
time. Table 4-6 lists gases commonly employed in CVD processes together 
with some of their characteristics. A simple entry in the table does not 
accurately reflect the nature of the gas in practice. Silane, for example, more 
so than other gases employed in the semiconductor industry, has an ominous 
and unpredictable nature. It is stable but pyrophoric, so it ignites on contact 
with air. If it accumulates in a stagnant airspace, however, the resulting 
mixture may explode upon ignition. In simulation tests of leaks, high flow 
rates of silane have resulted in violent explosions. For this reason, silane 
cylinders are stored outside buildings in concrete bunkers. The safety problems 
are magnified in low-pressure processing where concentrated gases are used. 
For example, in the deposition of polysilicon, pure silane is used during 
LPCVD, whereas only 3% silane is employed in atmospheric CVD processing. 

Corrosive attack of gas-handling equipment (e.g., valves, regulators, piping) 
occurs in virtually all CVD systems. The problems are particularly acute in 
LPCVD processing because of the damage to mechanical pumping systems. 
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Table 4-6. Hazardous Gases Employed in CVD 







Bodily 

Gas 

Corrosive 

Flammable 

Pyrophoric 

Toxic 

Hazard 

Ammonia 

X 



X 

eye and 

(NH 3 ) 





respiratory 

irritation 

Arsine 


X 


X 

Anemia, 

(AsH 3 ) 





kidney damage 
death 

Boron Trichloride 

X 





(BC1 3 ) 

Boron Trifluoride 

X 





(BF 3 ) 

Chlorine 

X 



X 

Eye and 

(Cl 2 ) 





respiratory 

irritation 

Diborane 


X 

X 

X 

Respiratory 

(B 2 H 6 ) 

Dichlorosilane 

(SiH 2 Cl 2 ) 

X 

x 



irritation 

Germane 

(GeH 4 ) 

Hydrogen chloride 
(HC1) 

X 

X 


X 


Hydrogen fluoride 
(HF) 

Hydrogen 

(H 2 ) 

X 

X 



Severe burns 

Phosphine 


X 

X 

X 

Respiratory 

(PH 3 ) 





irritation, 

death 

Phosphorous 

pentachloride 

(PC1 5 ) 

Silane 

(SiH 4 ) 

Silicon 

X 

X 

X 

X 


tetrachloride 

(SiCl 4 ) 

Stibine 

(SbH 3 ) 

X 

X 


X 
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Since many reactors operate at high temperatures, the effluent gases are very 
hot and capable of farther downstream reactions in the pumping hardware. 
Furthermore, the exhaust stream generally contains corrosive species such as 
acids, water, oxidizers, unreacted halogenated gases, etc., in addition to large 
quantities of abrasive particulates. In semiconductor processing, for example, 
Si0 2 and Si 3 N 4 particles are most common. All of these products are ingested 
by the mechanical pumps, and the chamber walls become coated with precipi¬ 
tates or particulate crusts. The oils used are degraded through polymerization 
and incorporation of solids. The lubrication of moving parts and hardware is 
thus hampered, and they tend to corrode and wear out more readily. All of this 
is a small price to pay for the wonderful array of film materials that CVD has 
made possible. 



Exercises 



1- a. Write a balanced chemical equation for the CVD reaction that pro¬ 
duces A1 2 0 3 films from the gas mixture consisting of A1C1 3 4- C0 2 + 
H 2 . 

b. If a 2-^m-thick coating is to be deposited on a 2-cm-diameter substrate 
placed within a tubular reactor 50 cm long and 5 cm in diameter, 
calculate the minimum weight of A1C1 3 precursor required. 

c. Repeat parts (a) and (b) for VC films from a VC1 4 4- C 6 H 5 CH 3 + H 2 
gas mixture. 

2 . Consider the generic reversible CVD reaction 

A ^ B , + C *( 7 '2>7’ 1 ) 

J 2 

at 1 atm pressure (P A 4- P c = 1), where the free energy of the reaction is 

AG° = AH° - T AS°. Through consideration of the equilibria at T x 

and J 2 , 

a. derive an expression for A P A ~ P A (T ] ) — P A (T 2 ) as a function of T , 
AH, and A5. 

b. plot AP a as a function of AH. 

c. comment on the gas transport direction and magnitude as a function of 
the sign and value of A H. 

3- In growing epitaxial Ge films by the disproportionation reaction of Eq. 
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4-13, the following thermodynamic data apply: 

I 2 ( g) = 2I (g j AG° = —38 .47 cal/mole 

Ge (J) + 1 2 (g) = GeI 2 (g) AG° = - 1990 - 11.27 cal/mole 

Ge (5) -I- Gel 4{g) = 2GeI 2(g) AG° = 36300 — 57.57 cal/mole 

a. What is AG° for the reaction Ge (J) + 2I 2(g) = Gel 4(g) ? 

b. Suggest a reactor design. Which region is hotter; which is cooler? 

c. Roughly estimate the operating temperature of the reactor. 

d. Suggest how you would change the reactor conditions to deposit 
polycrystalline films. 

4 - a. At 1200 °C the following growth rates of Si films were observed using 
the indicated Si-Cl-H precursor gas. The same CVD reactor was 
employed for all gases. 


Precursor 

Growth Rate (/im/min) 

SiH 4 

1 

SiH 2 Cl 2 

0.5 

SiHCl 3 

0.3 

SiCl 4 

0.15 


b. The density of poly-Si nuclei on an Si0 2 substrate at 1000 °C was 
observed to be 10 10 cm -2 for SiH 4 , 5 x 10 7 cm' 2 for SiH 2 Cl 2 , and 
3 X 10 6 cm" 2 for SiHCl 3 . 

Are the observations made in (a) and (b) consistent? From what you know 
about these gases explain the two findings. 

5. Plot lnP^ci /^sici 4 ^H 2 vs * 1/7* K for the temperature range 800 to 1500 
K, using the results of Fig. 4-5. 

a. What is the physical significance of the slope of this Arrhenius plot? 

b. Calculate AH for the reaction given by Eq. 4-22a, using data in 
Fig. 4-4. 

6 - Assume you are involved in a project to deposit ZnS and CdS films for 
infrared optical coatings. Thermodynamic data reveal 

H 2 S (g) + Zn (5) -> ZnS (5) + H 2(g) 

AG = -76,400 + 82.17- 5.97 In T (cal/mole) 

2 * H 2S( g ) + Gd (g) ”*■ GdS (s) + H 2 (g) 

A G= -50,000 + 85.27 - 6.647 In 7 (cal/mole) 
a. Are these reactions endothermic or exothermic? 
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b. In practice, reactions 1 and 2 are carried out at 680 °C and 600 °C, 
respectively. From the vapor pressures of Zn and Cd at these tempera¬ 
tures, estimate the P\i 2 /PhiS rat *° f° r eac ^ reaction, assuming equi¬ 
librium conditions. 

c. Recommend a reactor design to grow either ZnS or CdS, including a 
method for introducing reactants and heating substrates. 

7. It is observed that when WF 6 gas passes over a substrate containing 
exposed areas of Si and Si0 2 : 

1. W selectively deposits over Si and not over Si0 2 . 

2. Once a continuous film of W deposits (i.e., - 100-150 A), the 
reaction is self-limiting and no more W deposits. 

Suggest a possible way to subsequently produce a thicker W deposit. 

8 . The disproportionation reaction Si + SiCl 4 = 2SiCI 2 (AG° = 83,000 + 
3.647 log 7 — 89.47 (cal/mole)) is carried out in a closed tubular 
atmospheric pressure reactor whose diameter is 15 cm. Deposition of Si 
occurs on a substrate maintained at 750 °C and located 25 cm away from 
the source, which is heated to 900 °C. Assuming thermodynamic equilib¬ 
rium prevails at source and substrate, calculate the flux of SiCl 2 trans¬ 
ported to the substrate if the gas viscosity is 0.08 cP. [Hint: See problem 
2 .] 

9- Find the stoichiometric formula for the following films: 

a. PECVD silicon nitride containing 20 at% H with a Si/N ratio of 1.2. 

b. LPCVD silicon nitride containing 6 at% H with a Si/N ratio of 0.8. 

c. LPCVD Si0 2 with a density of 2.2 g/cm 3 , containing 3 x 10 21 H 
atoms/cm 3 . 

10. Tetrachlorosilane diluted to 0.5% mole in H 2 gas flows through a 
12-cm-diameter, tubular, atmospheric CVD reactor at a velocity of 20 
cm/sec. Within the reactor is a flat pallet bearing Si wafers resting 
horizontally. If the viscosity of the gas is 0.03 cP at 1200 °C, 

a. what is the Reynolds number for the flow? 

b. estimate the boundary layer thickness at a point 5 cm down the pallet. 

c. If epitaxial Si films deposit at a rate of 1 ^tm/min, estimate the 
diffusivity of Si through the boundary layer. 

o 

11 . Polysilicon deposits at a rate of 30 A/min at 540 °C. What deposition 
rate can be expected at 625 °C if the activation energy for film deposition 
is 1.65 eV? 
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12- Consider a long tubular CVD reactor in which one-dimensional steady- 
state diffusion and convection processes occur together with a homoge¬ 
neous first-order chemical reaction. Assume the concentration C(*) of a 
given species satisfies the ordinary differential equation 

d 2 C dC 

D— - V —-KC = 0, 
dx dx 

where K is the chemical rate constant and x is the distance along the 
reactor. 

a. If the boundary conditions are C(x = 0) = 1 and C(x = 1 m) = 0. 
derive an expression for C(x). 

b. If C(.v = 0) = 1 and dC/dx(x = 1 m) = 0. derive an expression for 
C(.v). 

c. Calculate expressions for the concentration profiles if D = 1000 
cnr/sec, v = 100 cm/sec. and K = 1 sec *. [Hint: A solution to the 
differential equation is exp ax. where a is a constant.] 

13, Select any film material (e.g.. semiconductor, oxide, nitride, carbide 
metal alloy, etc.) that has been deposited or grown by both PVD and 
CVD methods. In a report, compare the resultant structures, stoichiome¬ 
tries. and properties. The Journo! of Vacuum Science and Technology 
and Thin Solid Films are good references for such information. 
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Film Formation 
and Structure 


5.1- Introduction 


Interest in thin-fiim formation processes dates at least to the 1920s. During 
research at the Cavendish Laboratories in England on evaporated thin films, 
the concept of formation of nuclei that grew and coalesced to form the film was 
advanced (Ref. 1). All phase transformations, including thin-film formation, 
involve the processes of nucleation and growth. During the earliest stages of 
film formation, a sufficient number of vapor atoms or molecules condense and 
establish a permanent residence on the substrate. Many such film birth events 
occur in this so-called nucleation stage. Although numerous high-resolution 
transmission electron microscopy investigations have focused on the early 
stages of film formation, it is doubtful that there is a clear demarcation 
between the end of nucleation and the onset of nucleus growth. The sequence 
of nucleation and growth events can be described with reference to the 
micrographs of Fig. 5-1. Soon after exposure of the substrate to the incident 
vapor, a uniform distribution of small but highly mobile clusters or islands is 
observed. In this stage the prior nuclei incorporate impinging atoms and 
subcritical clusters and grow in size while the island density rapidly saturates. 
The next stage involves merging of the islands by a coalescence phenomenon 
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25 A IOO A 300 A 



700 ii 900 1 

Figure 5-1. Nucleation, growth and coalescence of Ag films on (111) NaCl sub¬ 
strates. Corresponding diffraction patterns are shown. (Courtesy of R. W. Vook. 
Reprinted with permission from the Metals Society, from R. W. Vook, Int. Metals 
Rev. 27, 209(1982). 
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STRANSKI - KRASTANOV 
Figure 5-2. Basic modes of thin-fiim growth. 


that is liquidlike in character especially at high substrate temperatures. Coales¬ 
cence decreases the island density, resulting in local denuding of the substrate 
where further nucleation can then occur. Crystallographic facets and orienta¬ 
tions are frequently preserved on islands and at interfaces between initially 
disoriented, coalesced particles. Coalescence continues until a connected net¬ 
work with unfilled channels in between develops. With further deposition, the 
channels fill in and shrink, leaving isolated voids behind. Finally, even the 
voids fill in completely, and the film is said to be continuous. This collective 
set of events occurs during the early stages of deposition, typically accounting 
for the first few hundred angstroms of film thickness. 

The many observations of film formation have pointed to three basic growth 
modes: (1) island (or Volmer-Weber), (2) layer (or Frank-van der Merwe), 
and (3) Stranski-Krastanov, which are illustrated schematically in Fig. 5-2. 
Island growth occurs when the smallest stable clusters nucleate on the substrate 
and grow in three dimensions to form islands. This happens when atoms or 
molecules in the deposit are more strongly bound to each other than to the 
substrate. Many systems of metals on insulators, alkali halide crystals, graphite, 
and mica substrates display this mode of growth. 

The opposite characteristics are displayed during layer growth. Here the 
extension of the smallest stable nucleus occurs overwhelmingly in two dimen¬ 
sions resulting in the formation of planar sheets. In this growth mode the atoms 
are more strongly bound to the substrate than to each other. The first complete 
monolayer is then covered with a somewhat less tightly bound second layer. 
Providing the decrease in bonding energy is continuous toward the bulk crystal 
value, the layer growth mode is sustained. The most important example of this 
growth mode involves single-crystal epitaxial growth of semiconductor films, a 
subject treated extensively in Chapter 7. 
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The layer plus island or Stranski-Krastanov (S.K.) growth mechanism is an 
intermediate combination of the aforementioned modes. In this case, after 
forming one or more monolayers, subsequent layer growth becomes unfavor¬ 
able and islands form. The transition from two- to three-dimensional growth is 
not completely understood, but any factor that disturbs the monotonic decrease 
in binding energy characteristic of layer growth may be the cause. For 
example, due to Film-substrate lattice mismatch, strain energy accumulates in 
the growing film. When released, the high energy at the deposit-intermediate- 
layer interface may trigger island formation. This growth mode is fairly 
common and has been observed in metal-metal and metal-semiconductor 
systems. 

At an extreme far removed from early film formation phenomena is a 
regime of structural effects related to the actual grain morphology of polycrys¬ 
talline films and coatings. This external grain structure together with the 
internal defect, void, or porosity distributions frequently determines many of 
the engineering properties of films. For example, columnar structures, which 
interestingly develop in amorphous as well as polycrystalline films, have a 
profound effect on magnetic, optical, electrical, and mechanical properties. In 
this chapter we discuss how different grain and deposit morphologies evolve as 
a function of deposition variables and how some measure of structural control 
can be exercised. Modification of the film structure through ion bombardment 
or laser processing both during and after deposition has been a subject of 
much research interest recently and is treated in Chapters 3 and 13. Subse¬ 
quent topics in this chapter are: 

5.2 Capillarity Theory 

5.3 Atomistic Nucleation Processes 

5.4 Cluster Coalescence and Depletion 

5.5 Experimental Studies of Nucleation and Growth 

5.6 Grain Structure of Films and Coatings 

5.7 Amorphous Thin Films 

References 1-5 are recommended sources for much of the subject matter in 
this chapter. 


5-2. Capillarity Theory 


5.2.1. Thermodynamics 

Capillarity theory possesses the mixed virtue of yielding a conceptually simple 
qualitative model of film nucleation, which is, however, quantitatively inaccu- 
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rate. The lack of detailed atomistic assumptions gives the theory an attractive 
broad generality with the power of creating useful connections between such 
variables as substrate temperature, deposition rate, and critical film nucleus 
size. An introduction to the thermodynamic aspects of homogeneous nucle- 
ation was given on p. 40 and is worth reviewing. In a similar spirit, we now 
consider the heterogeneous nucleation of a solid film on a planar substrate. 
Film-forming atoms or molecules in the vapor phase are assumed to impinge 
on the substrate, creating aggregates that either tend to grow in size or 
disintegrate into smaller entities through dissociation processes. 

The free-energy change accompanying the formation of an aggregate of 
mean dimension r is given by 

AG = a 3 r 3 AG K + a,r 2 y v/ + a 2 r 2 y fs - a 2 r 2 y su . (5-1) 

The chemical free-energy change per unit volume, A G v , drives the condensa¬ 
tion reaction. According to Eq. 1-39, any level of gas-phase supersaturation 
generates a negative A G v without which nucleation is impossible. There are 
several interfacial tensions, 7 , to contend with now, and these are identified by 
the subscripts /, s , and u representing film, substrate, and vapor, respec¬ 
tively. For the cap-shaped nucleus in Fig. 5-3, the curved surface area (fl,/* 2 ), 
the projected circular area on the substrate ( a 2 r 2 ), and the volume (a 3 r 3 ) are 
involved, and the respective geometric constants are a x = 2 7 r(l - cos 0), 
a 2 = tt sin 2 0, a 3 = 7t(2 - 3 cos 0 4- cos 3 0)/3. Consideration of the mechani¬ 
cal equilibrium among the interfacial tensions or forces yields Young’s equa¬ 
tion 

~Ysv = 7/s + 7„/COS d. (5-2) 

Therefore, the contact angle 0 depends only on the surface properties of the 
involved materials. The three modes of film growth can be distinguished on the 

DEPOSITION DESORPTION 



Figure 5-3. Schematic of basic atomistic processes on substrate surface during vapor 
deposition. 
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basis of Eq. 5-2. For island growth, 6 > 0, and therefore 

Tsu ^ yfs yvf- (^ _ ^) 

For layer growth the deposit “wets” the substrate and 0 = 0. Therefore, 

Ysu = y f s + ~Vuf ■ (5-4) 

A special case of this condition is ideal homo- or autoepitaxy . Because the 
interface between film and substrate essentially vanishes, y fs = 0. Lastly, for 
S.K. growth, 

Isu > Ifs + 7„/ • (5-5) 


In this case, the strain energy per unit area of film overgrowth is large with 
respect to y v/ , permitting nuclei to form above the layers. In contrast, a film 
strain energy that is small compared with y vf is characteristic of layer growth. 

Returning now to Eq. 5-1, we note that any time a new interface appears 
there is an increase in surface free energy, hence the positive sign for the first 
two surface terms. Similarly, the loss of the circular substrate-vapor interface 
under the cap implies a reduction in system energy and a negative contribution 
to AG. The critical nucleus size r* (i.e., the value of r when d AG/dr = 0) 
is given by differentiation, namely, 


* = — 2( g i7t,/ + g 2 7/j ~ 

3 a 3 AG v 

Correspondingly, AG evaluated at r = r* is 

4 ( a iT,/ + a 2 y fs - a 2 y sv f 
27 a\ AG\ 


(5-6) 


(5-7) 


Both r * and AG* scale in the manner shown in Fig. 1-19. An aggregate 
smaller in size than r* disappears by shrinking, lowering AG in the process. 
Critical nuclei grow to supercritical dimensions by further addition of atoms, a 
process that similarly lowers AG. In heterogeneous nucleation the accommo¬ 
dating substrate catalyzes vapor condensation and the energy barrier AG* 
depends on the contact angle. After substitution of the geometric constants, it 
is easily shown that AG* is essentially the product of two factors; i.e., 

2 - 3 cos $ 4- cos 3 0 \ 

-j-)■ (S'®) 

The first is the value for AG* derived for homogeneous nucleation. It is 
modified by the second term, a wetting factor that has the value of zero for 
6 = 0 and unity for 6 = 180°. When the film wets the substrate, there is no 
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barrier to nucleation. At the other extreme of dewetting, AG* is maximum and 
equal to that for homogeneous nucleation. 

The preceding formalism provides a generalized framework for inclusion of 
other energy contributions. If, for example, the film nucleus is elastically 
strained throughout because of the bonding mismatch between film and sub¬ 
strate, then a term a 3 r 3 AG 5 , where A G s is the strain energy per unit volume, 
would be appropriate. In the calculation for AG*, the denominator of Eq. 5-7 
would then be altered to 21aj(AG v + AG^) 2 . Because the sign of AG y is 
negative while AG s is positive, the overall energy barrier to nucleation 
increases in such a case. If, however, deposition occurred on an initially 
strained substrate—i.e., one with emergent cleavage steps or screw disloca¬ 
tions—then stress relieval during nucleation would be manifested by a reduc¬ 
tion of AG*. Substrate charge and impurities similarly influence AG* by 
affecting terms related to either surface and volume electrostatic, chemical, 
etc., energies. 


5.2.2. Nucleation Rate 


The nucleation rate is a convenient synthesis of terms that describes how many 
nuclei of critical size form on a substrate per unit time. Nuclei can grow 
through direct impingement of gas phase atoms, but this is unlikely in the 
earliest stages of film formation when nuclei are spaced far apart. Rather, the 
rate at which critical nuclei grow depends on the rate at which adsorbed 
monomers (adatoms) attach to it. In the model of Fig. 5-3, energetic vapor 
atoms that impinge on the substrate may immediately desorb, but usually they 
remain on the surface for a length of time r s given by 


1 ^dcs 

r * = _exp Tr 

v kT 


(5-9) 


The vibrational frequency of the adatom on the surface is v (typically 10' 2 
sec 1 ), and £ dcs is the energy required to desorb it back into the vapor. 
Adatoms, which have not yet thermally accommodated to the substrate, 
execute random diffusive jumps and, in the course of their migration, may 
form pairs with other adatoms, or attach to larger atomic clusters or nuclei. 
When this happens, it is unlikely that these atoms will return to the vapor 
phase. Changes in £ des are particularly expected at substrate heterogeneities, 
such as cleavage steps or ledges where the binding energy of adatoms is 
greater relative to a planar surface. The proportionately large number of 
atomic bonds available at these accommodating sites leads to higher E des 
values. For this reason, a significantly higher density of nuclei is usually 
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observed near cleavage steps and other substrate imperfections. The presence 
of impurities similarly alters in a complex manner, depending on type and 
distribution of atoms or molecules involved, 

We now exploit some of these microscopic notions in the capillarity theory 
of the nucleation rate TV. Reproducing Eq. 1-41, we obtain the expression 
for TV: 

TV = N*A*a) nuclei/cm 2 -sec. (5-10) 

(The Zeldovich factor, included in other treatments, is omitted here for 
simplicity.) Based on the thermodynamic probability of existence, the equilib¬ 
rium number of nuclei of critical size per unit area of substrate is given by 

TV* = n s cx p - A G*/kT. (5-11) 

The quantity n 5 represents the total nucleation site density. A certain number 
of these sites are occupied by adatoms whose surface density, n a , is given by 
the product of the vapor impingement rate (Eq. 2-8) and the adatom lifetime, 
or 

n Q = t s PN a / \/2 ttMRT . (5-12) 

Surrounding the cap-shaped nucleus of Fig. 5-3 are adatoms poised to attach to 
the circumferential belt whose area is 

A* = 2 7rr*a 0 sin 6 . (5-13) 

Quantities r* and 0 were defined previously, and a 0 is an atomic dimension. 

Lastly, the impingement rate onto area A * requires adatom diffusive jumps 
on the substrate with a frequency given by vexp - E s /kT , where E s is the 
activation energy for surface diffusion. The overall impingement flux is the 
product of the jump frequency and n a , or 

r s PN A vexp - E s jkT (cm _2 sec _i ) 
v2 ttMRT 

There is no dearth of adatoms that can diffuse to and be captured by the 
existing nuclei. During their residence time, adatoms are capable of diffusing a 
mean distance X from the site of incidence given by 

X=y/2D^ S . (5-15) 

The surface diffusion coefficient D s is essentially 

D s = (\/2)a\v exp - E s /kT 

and therefore 



(5-14) 


X = a 0 exp 


2kT 


(5-16) 
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Large values of E *. coupled with small values of E s serve to extend the 
nucleus capture radius. 

Upon substitution of Eqs. 5-11, 5-13, and 5-14 in Eq. 5-10, we obtain 

PN a E des - E s - AG* 

N = 2 7rr*a 0 sin 6— 7 ==n s exp -—-. (5-17) 

0 VIttMRT s y kT v ' 

The nucleation rate is a very strong function of the nucleation energetics, 
which are largely contained within the term AG*. It is left to the reader to 
develop the steep dependence of N on the vapor supersaturation ratio. As 
noted previously, a high nucleation rate encourages a fine-grained, or even 
amorphous, structure, whereas a coarse-grained deposit develops from a low 
value of N. 

5.2.3. Nucleation Dependence on Substrate Temperature and 
Deposition Rate 

Substrate temperature and deposition rate R (atoms/cm 2 -sec) are among the 
chief variables affecting deposition processes. Calculating their effect on r* 
and AG* is both instructive and simple to do. First we define 

kT R 

* g -=-T (5 - ,8) 

by analogy to Eq. 1-38, where R e is the equilibrium evaporation rate from the 
film nucleus at the substrate temperature 7, and fl is the atomic volume. 
Assuming an inert substrate, i.e., y vf = y fs , direct differentiation of Eq. 5-6 
yields (Ref. 3) 


M'*\ 2 

y v/ (dA GJdT) - (a, + a 2 ) A G v {dy uf /dT) 

\dT) k 3 

a 3 A Gy 


Assuming typical values, y v j- = 1000 ergs/cm 2 and dy vf /dT = -0.5 
erg/cm 2 -K. An estimate for d A GJdT is the entropy change for vaporization, 
which is roughly 8 X 10 7 ergs/cm 3 -K for many metals. If |AGJ < 1.6 X 
10 11 ergs/cm 3 , then direct substitution gives 

(br*/dT) k > 0. (5-20) 


Similarly, by the same assumptions and arguments, 

(aAG*/a7)^ > 0. 


(5-21) 
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It is also simple to show that 

{dr*/dR) T < 0. (5-22) 

Direct chain-rule differentiation using Eqs. 5-6 and 5-18 yields 
dr* / dr* \ldAG y \ / r* \ I kT \ 

Jr ~ \ JIcTy) \ Jr ) = (" ag ^ ) \ Jr )' 

Since A G v is negative, the overall sign is negative. Similarly, it is easily 
shown that 

(dAG*/dR) T <0. (5-23) 

The preceding four inequalities have interesting implications and summarize 
a number of common effects observed during film deposition. From Eq. 5-20 
we note that a higher substrate temperature leads to an increase in the size of 
the critical nucleus. Also a discontinuous island structure is predicted to persist 
to a higher average coverage than at low substrate temperatures. The second 
inequality (Eq. 5-21) suggests that a nucleation barrier may exist at high 
substrate temperatures, whereas at lower temperatures it is reduced in magni¬ 
tude. Also because of the exponential dependence of N on AG*, the number 
of supercritical nuclei decreases rapidly with temperature. Thus, a continuous 
film will take longer to develop at higher substrate temperatures. From Eq. 
5-22, it is clear that increasing the deposition rate results in smaller islands. 
Because AG* is also reduced, nuclei form at a higher rate, suggesting that a 
continuous film is produced at lower average film thickness. If, for the 
moment, we associate a large r* and AG* with being conducive to large 
crystallites or even monocrystal formation, this is equivalent to high substrate 
temperatures and low deposition rates. Alternatively, low substrate tempera¬ 
tures and high deposition rates yield polycrystalline deposits. These simple 
guidelines summarize much practical deposition experience in a nutshell. A 
plot of these structural zones as a function of deposition rate and substrate 
temperature is shown in Fig. 5-4 for Cu films deposited on a (111) NaCl 
substrate. Maps of this sort have been graphed for other film-substrate 
combinations. In semiconductor systems a regime of amorphous film growth is 
additionally observed when R is large and T is low. 

Despite the qualitative utility of capillarity theory, it is far from being 
quantitatively correct. For example, let us estimate the value of r* for a typical 
metal film at 300 K. To simplify matters, the homogeneous nucleation for¬ 
mula, r* = -2y/AG K , is used, where A G v = - kT/VL In P v /I\ (Eq. 
1-38). Assuming D = 20 X 10 _24 cm 3 /at, y = 1000 ergs/cm 2 , P v = 10 3 torr, 
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Figure 5-4. Dependence of microstructure on deposition rate and substrate tempera¬ 
ture for Cu films on (111) NaCl. (Some substrate temperatures are noted.) (Reprinted 
with permission from the Metals Society, from R. W. Vook, Int. Metals Rev. 27, 209, 
1982). 

and = 10' 10 torr, then 

- 2 ( 1000 ) 

r* = -rr 7 ---ttt— 7 -—- — = 6 x 10 8 cm. 

-1.38 x 10“ 6 (300/20 x 10“ 24 )In(10 3 /10“ 10 ) 

A nucleus of this size contains just a few atoms, and it is therefore doubtful 
that continuum concepts like surface tension and nucleus radius have much 
significance in such a case. It is more realistic to interpret vapor condensation 
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phenomena in terms of a theory of heterogeneous nucleation based on an 
atomistic model. This is considered in the next section. 


5 . 3 . Atomistic Nucleation Processes 


5.3.1. The Walton - Rhodin Theory 


Atomistic theories of nucleation describe the role of individual atoms and 
associations of small numbers of atoms during the earliest stages of film 
formation. An important milestone in the atomistic approach to nucleation was 
the theory advanced by Walton et ai. (Ref. 6), which treated clusters as 
macromolecules and applied concepts of statistical mechanics in describing 
them. They introduced the critical dissociation energy # is /+ , defined to be that 
required to disintegrate a critical cluster containing / atoms into i separate 
adatoms. The critical concentration of clusters per unit area of size /, TV,*, is 
then given by 


AL 




E it 

exp *r 


(5-24) 


which expresses the chemical equilibrium between the clusters and monomers. 
In Eq. 5-24, E im may be viewed as the negative of a cluster formation energy, 
n 0 is the total density of adsorption sites, and iV, is the monomer density. The 
latter, by analogy to Eq. 5-12, is given by N, = Rt s . Hence, 

= Rv-'expE^/kT. (5-25) 


Lastly, the critical monomer supply rate is essentially given by the vapor 
impingement rate and the area over which adatoms are capable of diffusing 
before desorbing. Therefore, through Eq. 5-16, 

rx 2 = /?ffo ex P • ( 5 ~ 26 ) 

By combining Eqs. 5-24-5-26, we obtain the critical nucleation rate (cm 2 
sec” ') 


AL = Rain 


o"o 


R 




exp 


(/* + !)£*, - E s + E„ 


kT 


(5-27) 


This expression for the nucleation rate of small clusters has been central to 
subsequent theoretical treatments and in variant forms has also been used to 
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test much experimental data. It has the advantage of expressing the nucleation 
rate in terms of measurable parameters rather than macroscopic quantities such 
as AG*, 7 , or 6 , which are not known with certainty, nor are they easily 
estimated in the capillarity theory. The uncertainties are now in /* and 

One of the important applications of this theory has been to the subject of 
epitaxy, where the crystallographic geometry of stable clusters has been related 
to different conditions of supersaturation and substrate temperature. In Fig. 
5-5, experimental data for Ag on (100) NaCl are shown together with the 
atomistic model of evolution to stable clusters upon attachment of a single 
adatom. At high supersaturations and low temperatures, the nucleation rate is 
frequently observed to depend on the square of the deposition rate; i.e., 
N* oc /? 2 , indicating that /* = 1. This means that a single adatom is, in effect, 
the critical nucleus. At higher temperatures, two or three atom nuclei are 
critical, and the stable clusters now assume the planar atomic pattern sugges¬ 
tive of (111) or (100) packing, respectively. Epitaxial films would then evolve 
over macroscopic dimensions provided the original nucleus orientation were 
preserved with subsequent deposition and cluster impingement. 

A thermally activated nucleation rate whose activation energy depends on 
the size of the critical nucleus is predicted by Eq. 5-27. This suggests the 
existence of critical temperatures where the nucleus size and orientation 
undergo change. For example, the temperture T 1 _ 2 at which there is a 



1000/T(K) 

Figure 5-5. Nucleation rate of Ag on (100) NaCl as a function of temperature. Data 
for three different deposition rates are plotted. Also shown are smallest stable epitaxial 
clusters corresponding to critical nuclei. (From Ref. 6). 
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transition from a one- to two-atom nucleus is given by equating the respective 
nucleation rates: 

After substitution and some algebra, we have 


^des + 

k ln(/?/ vn 0 ) 


(5-28a) 


or 


R 


un 0 e\p - 


^des + ^2 

kT \~ 2 


(5-2 8b) 


Equation 5-28b can be used to analyze the data in Fig. 5-4. The transition of 
polycrystalline Cu to single-crystal Cu Films can be thought of as the onset of 
(111) epitaxy (/* = 2) from atomic nuclei (/* = 1). The Arrhenius plot of the 
line of demarcation between mono- and polycrystal deposits yields an activa¬ 
tion energy of E des + E 2 equal to 1.48 eV. At a deposition rate of 8.5 x 10 14 
atoms/cm 2 -sec or 1 A/sec, and vn 0 = 6.9 x 10 27 atoms/cm 2 -sec\ the epitax¬ 
ial transition temperature can be calculated to be 577 K. Equations similar to 
5-28 can be derived for transitions between /* = 1 and /* = 3 or /* = 2 and 
/* = 3, etc. 

It is instructive to compare these estimates of epitaxial growth temperatures 
with ones based on surface-diffusion considerations. For smooth layer-by-layer 
growth on high-symmetry crystal surfaces, adatoms must reach growth ledges 



Figure 5-6. Lattice ( D L ) and surface diffusivities ( D s ) as a function of T M /T for 
metals, semiconductors, and alkali halides. (From Ref. 7), 
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by diffusional hopping before other islands nucleate. Atoms must, therefore, 
migrate a distance of some 100-1000 atoms, the typical terrace width on 
well-oriented surfaces during the monolayer growth time of 0.1-1 sec. By Eq. 
5-15 a surface diffusivity greater than - 10“ 8 cm 2 /sec is, therefore, required 
irrespective of the material deposited. This means a different critical epitaxial 
growth temperature (T £ ), depending on whether semiconductor, metal, or 
alkali halide films are involved. In the Arrhenius plot of Fig. 5-6, D s values 
are graphed versus T M / 7, where T m is the melting point. Similar relation¬ 
ships for lattice, grain-boundary, and dislocation diffusivities will be intro¬ 
duced in Chapter 8 . At the critical value of D s = 10 “ 8 , Fig. 5-6 indicates that 
T e ~ 0.57 M , ~ 0.37 M , and - 0.17^ for layer growth on group IV semicon¬ 
ductors, metals, and alkali halides, respectively. These temperatures agree 
qualitatively with experimental observations. 

5.3.2. Kinetic Models of Nucleation 

More recent approaches to the modeling of nucleation processes have stressed 
the kinetic behavior of atoms and clusters containing a small number of atoms. 
Rate equations similar to those describing the kinetics of chemical reactions are 
used to express the time rate of change of cluster densities in terms of the 
processes that occur on the substrate surface. Because these theories and 
models are complex mathematically as well as physically, the discussion will 
stress the results without resorting to extensive development of derivations. It 
is appropriate to start with the fate of the mobile monomers. If coalescence is 
neglected, then 

dN N 00 

— 77 - = *-- - ^i ^, 2 - N, K,Nj. (5-29) 

The equation states that the time rate of change of the monomer density is 
given by the incidence rate, minus the desorption rate, minus the rate at which 
two monomers combine to form a dimer. This latter term follows second-order 
kinetics with a rate constant K x . The last term represents the loss in monomer 
population due to their capture by larger clusters containing two or more 
atoms. There are similar equations describing the population of dimer, trimer, 
etc., clusters as they interact with monomers. The general form of the rate 
equation for clusters of size / is 

dN,/dt = K,_,N x N,_,- KjN x N ,, (5-30) 

where the first term on the right expresses their increase by attachment of 
monomers to smaller (/ — l)-sized clusters, and the second term their decrease 
when they react with monomers to produce larger (/ + l)-sized clusters. There 
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are / of these coupled rate equations to contend with, each one of which 
depends on direct impingement from the vapor as well as desorption through 
their link back to Eq. 5-25. Inclusion of diffusion terms, i.e., d 2 N i /dx 2 J 
enables change in cluster shape to be accounted for. When cluster mobility and 
coalescence are also taken into account, a fairly complete chronology of 
nucleation events emerges but at the expense of greatly added complexity. 

Transient as well as steady-state (i.e., where dNJdt = 0) solutions have 
been obtained for the preceding rate equations for a variety of physical 
situations and the results are summarized below starting with the case for 
which /* = 1. 


1. The nucleation and growth kinetics for the one-atom critical nucleus has 
been treated under simplifying assumptions by Robinson and Robins (Ref. 8). 
They divided the deposition conditions into two categories. At high tempera¬ 
tures and/or low deposition rates, the reevaporation rate from the surface will 
control the adatom density and exceed the rate of diffusive capture into 
growing nuclei. Here the adsorption-desorption equilibrium is rapidly estab¬ 
lished where N x = Rr s , and incomplete condensation is said to occur. When 
reevaporation is not important, however, because the energy to desorb adatoms 
( E des ) is large, then we speak of complete condensation. This regime domi¬ 
nates at low tempertures and/or high deposition rates where the monomer 
capture rate by growing nuclei exceeds the rate at which they are lost due to 
reevaporation. These same condensation regimes also refer to situations where 
/* > 1. The division between these high- and low-temperature regions is not 
abrupt like r,_ 2 (Eq. 5-28a). It can be characterized by a dividing tempera¬ 
ture T d , calculated to be 


_ = 2 Es - 3£ dcs 

D k\n[(Ca 2 /t3){R/vn 0 )] ' 


(5-31) 


where C is the number of pair formation sites (C = 4 for a square lattice), and 
a and 0 are dimensionless constants with typical values of — 0.3 and 4, 
respectively. Relatively simple closed-form analytic expressions for the time 
(/) dependence of the transient density of stable nuclei N(t) as well as the 
saturation value of N(t = oo) or N s in terms of measurable quantities were 
obtained at temperatures above or below T D . 

For T> T d , 


N(t) ~ A^tanh 


K(0)t \ 

I 


(5-32) 


^s = 


Cn 0 


1/2 

1/2 exp 


■^dcs 

2kT' 


(5-33) 
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For T < T D , 


N(t) = N s \ 1 - exp - 3 rj 


N(0)t 


1/3 


*S = 


Cn 2 c 

CL&V 


2 \ 1/3 


/? 1/3 exp 


3 kT 


Clearly, as t -* oo, N(oo) -> N s . For all temperatures. 


., , C/? 2 

N( 0) = -exp 




2^des ~ Es 
kT 


(5-34) 

(5-35) 


(5-36) 


Only N( 0), the initial nucleation rate (i.e., /V(0) = dN(t)/dt at t = 0) and 
rj, which is equal to (rt 0 /a)e\p(E s - E 6cs /kT), have not been defined 
before. 

The equations predict that N(t) increases with time, eventually saturating at 
the value N s . In practice, the stable nucleus density is observed to increase 
approximately linearly with deposition time, then saturate at values ranging 
from - 10 9 to 10 12 , depending on deposition rate and substrate temperature. 
With further deposition, N decreases due to nucleus coalescence. Further¬ 
more, N s is larger the lower the substrate temperature, as shown schematically 
in Fig. 5-7. This is an obvious consequence of Eqs. 5-33 and 5-35. When the 
temperature range spanned exceeds T D , N s data as a function of T directly 
yield Zs dcs /2 on an Arrhenius plot. Similarly, E s /3 can be extracted from 
such data when T <T D . In this manner, values of Sdcs and E s for noble 
metal-alkali halide systems have been obtained and the results summarized in 
Table 5-1. An experimental verification of Eq. 5-36 is shown in Fig. 5-8. That 



N s (T i) 
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Figure 5-7. Schematic dependence of N(t) with time and substrate temperature. 
(T 4 > T 3 > T 2 > T,) From Ref. 4. 
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Table 5-1. Nucleation Parameter Values for Noble Metal-Alkali Halide Systems 


System 

^dcs (eV) 

Es (cV) 

v (10 12 /sec) 

Au-NaCl 

0.70 

0.29 

1.65 

Au-KCl 

0.68 

0.25 

1.26 

Au-KBr 

0.83 

0.44 

- 24 

Au-NaF 

0.64 

0.093 

0.91 

Ag-NaCl 

0.63 

0.093 

- 0.8 

Ag-KCl 

0.48 

0.2 

0.24 

Ag-KBr 

0.48 

0.22 

~ 0.1 


From Ref. 5. 



1.5 2.0 2.5 3.0 


10 3 T -1 (K -1 ) 

Figure 5-8. Plot of N/R 2 vs. \/T (left) and t s vs. \/T (right) for Au on KC1. 
(From Ref. 5, with permission from IOP Publishing Ltd.) 

N(0)/R 2 is observed to vary linearly as a function of 1/ T is confirmation that 
Au monomers are stable nuclei on KC1 substrates. 

2. Venables et aL (Ref. 5) have neatly summarized nucleation behavior for 
cases where /* assumes any integer value. In general, the stable cluster density 
is given by 

R P E 

N s = An 0 - tx'p——, (5-37) 

n a v kT 
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Table 5-2. Nucleation Parameters p and E in Eq. 5-37 


Regime 

3D Islands 

2D Islands 

Extreme incomplete 

P = (2/3)/* 

/* 


E = (2/3)[ + (/* + l)£ d „ - E s ] 

E t . + (/* + 1 )E des - E s 

Initially incomplete 

P = 2/*/5 

i*/2 


E = (2/5 )(£,.. + /*£ d „) 

(1/2KE;. + /*£ d ,J 

Complete 

p = /*/(/* + 2.5) 

i*/(>* + 2) 


_ E >- + i*E s 

Ej* + / /"s 


i* + 2.5 

/* + 2 


From Ref. 5. 


where A is a calculable dimensionless constant dependent on the substrate 
coverage. Parameters p and E depend on the condensation regime and are 
summarized in Table 5-2. Three regimes of condensation and two types of 
island nuclei are considered. The complete and extreme incomplete conden¬ 
sation categories parallel those considered previously, but intermediate incom¬ 
plete condensation regimes may also be imagined, depending on deposition 
conditions. As a result of such generalized equations, experimental data for N s 
have been tested as a function of the substrate temperature and deposition rate, 
and values for the energies of desorption, diffusion, and cluster binding have 
been extracted from them. The Walton et al . nucleation theory is seen to be a 
special case of the more general rate theory. For extremely incomplete 
condensation of 2-D islands, the p and E parameters are seen to vary in the 
same way as Eq. 5-27. 

Although the major application of the kinetic model has been to island 
growth, the theory is also capable of describing S.K. growth. 


5 - 4 - Cluster Coalescence and Depletion 


The results of the kinetic theories of nucleation indicate that in the initial stages 
of growth the density of stable nuclei increases with time up to some maximum 
level before decreasing. In this section, the coalescence processes that are 
operative beyond the cluster saturation regime are examined. Coalescence of 
nuclei is generally characterized by the following features: 

1. A decrease in the total projected area of nuclei on the substrate occurs. 

2. There is an increase in the height of the surviving clusters. 

3. Nuclei with well-defined crystallographic facets sometimes become rounded. 
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4. The composite island generally reassumes a crystallographic shape with 
time. 

5. When two islands of very different orientation coalesce, the final compound 
cluster assumes the crystallographic orientation of the larger island. 

6. The coalescence process frequently appears to be liquidlike in nature with 
islands merging and undergoing shape changes after the fashion of liquid 
droplet motion. 

7. Prior to impact and union, clusters have been observed to migrate over the 
substrate surface in a process described as cluster-mobility coalescence. 

Several mass-transport mechanisms have been proposed to account for these 
coalescence phenomena, and these are discussed in turn. 


5.4.1. Ostwald Ripening (Ref. 2) 

Prior to coalescence there is a collection of islands of varied size, and with 
time the larger ones grow or “ripen” at the expense of the smaller ones. The 
time evolution of the distribution of island sizes has been considered both from 
a macroscopic surface diffusion-interface transfer viewpoint as well as from 
statistical models involving single atom processes. The former is simply driven 
by a desire to minimize the surface free energy of the island structure. To 
understand the process, consider two isolated islands of different size in close 
proximity, as shown in Fig. 5-9a. For simplicity they are assumed to be 
spherical with radii r, and r 2 . The surface free energy per unit area of a given 


MASS 



b. 
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2 . 


3. 



Figure 5-9. Coalescence of islands due to (a) Ostwald ripening, (b) sintering, (c) 
cluster migration. 
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island is y, so the total energy G y = 4 irrfy. The island contains a number of 
atoms /?, given by 4?r/f/3fl, where Q is the atomic volume. Defining the free 
energy per atom /*, as dG y /dn i in this application, we have, after substitu¬ 
tion, 

Sir rtf dr { 2Qy 

4 7r rj dr t / Q r i 

Making use of Eq. 1-9 (^, = fi 0 + kT In a t ), we see that the Gibbs-Thomson 
relation 

a , 2Qy 2Qy 

kT In— = - or a = tf^exp- (5-38) 

tfoo /*/ r ,kT 

directly follows. This equation states that atoms in an island of radius r i can be 
in equilibrium only with a substrate adatom activity or effective concentration 
a r The quantity a may be interpreted as the adatom concentration in 
equilibrium with a planar island (r, = 00) or, alternatively, with the vapor 
pressure of island atoms at temperature 7\ When the island surface is convex 
(r, is positive), atoms have a greater tendency to escape, compared with atoms 
situated on a planar surface, because there are relatively fewer atomic bonds to 
attach to. Therefore, a t > a^. Conversely, at a concave island surface, r i is 
negative and a „ > a r These simple ideas have significant implications not 
only with respect to Ostwald ripening but to the sintering mechanisms of 
coalescence that are treated in the next section. 

The establishment of the concentration gradient of adatoms situated between 
the two particles of Fig. 5-9a can now be understood. Diffusion of individual 
adatoms will proceed from the smaller to larger island until the former 
disappears entirely. A mechanism has thus been established for coalescence 
without the islands having to be in direct contact. In a multi-island array the 
kinetic details are complicated, but ripening serves to establish a quasi-steady- 
state island size distribution that changes with time. Ostwald ripening processes 
never reach equilibrium during film growth since the theoretically predicted 
narrow distribution of crystallite sizes is generally not observed. 

5.4.2. Sintering (Ref. 9) 

Sintering is a coalescence mechanism involving islands in contact. It can be 
understood by referring to Fig. 5-10, depicting a time sequence of coalescence 
events between Au particles deposited on molybdenite (MoS 2 ) at 400 °C and 
photographed within the transmission electron microscope (TEM). Within 
tenths of a second a neck forms between islands and then successively thickens 
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b c 



d e f 

Figure 5-10. Successive electron micrographs of Au deposited on molybdenite at 
400 °C illustrating island coalescence by sintering (a) arbitrary zero time, (b) .06 sec, 
(c) 0.18 sec, (d) 0.50 sec, (e) 1.06 sec, (0 6.18 sec. (From Ref. 10). 


as atoms are transported into the region. The driving force for neck growth is 
simply the natural tendency to reduce the total surface energy (or area) of the 
system. Since atoms on the convex island surfaces have a greater activity than 
atoms situated in the concave neck, an effective concentration gradient between 
these regions develops. This results in the observed mass transport into the 
neck. Variations in island surface curvature also give rise to local concentra¬ 
tion differences that are alleviated by mass flow. Of the several mechanisms 
available for mass transport, the two most likely ones involve self-diffusion 
through the bulk or via the surface of the islands. In the case of sintering or 
coalescence of two equal spheres of radius r (Fig. 9b), theoretical calculations 
in the metallurgical literature have shown that the sintering kinetics is given by 
x n /r m = A(T)t y where x is the neck radius, A(T) is a temperature-depen¬ 
dent constant, and n and m are constants. Explicit expressions for the bulk 
and surface diffusion mechanisms are 



l0irD L yUt 

kT 


(5-39) 
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and 


28 £> s 7 (Q) 4/3 / 

kT 


(540) 


respectively, and D L and D s are the lattice and surface diffusion coefficients, 
respectively. In principle, experimental determination of m and n and the 
activation energy for diffusion would serve to pinpoint the transport mecha¬ 
nism, but insufficient data have precluded such an analysis. A simple relative 
comparison between the two mechanisms can, however, be made to order to 
predict which dominates. The ratio of the times required to reach a neck radius 
x = O.lr, for example, is given by 


_ irr 

T l ~ 280 D s Q l/3 ’ 


(5-41) 


In the case of Au at 400 °C, the ratio of D L / D s can be directly read off Fig. 
5-6. At the value T u /7 = 1336/673 = 1.99, D L /D S ~ 10“ 13 /10~ 6 = 
10 -7 . Substituting Q 1 / 3 = 2.57 X 10 -8 cm and r - 10 -5 cm, t s /t L = 4.4 
X 10 “ 7 . Therefore, surface diffusion is expected to control sintering coales¬ 
cence. This is also true for any plausible combination of r and T values in 
films. 

Surface energy and diffusion-controlled mass-transport mechanisms un¬ 
doubtedly influence liquidlike coalescence phenomena involving islands in 
contact, yet other driving forces are probably also operative. For example, 
sintering mechanisms are unable to explain 


1. Observed liquidlike coalescence of metals on substrates maintained at 77 K 
where atomic diffusion is expected to be negligible 

2. Widely varying stabilities of irregularly shaped necks, channels, and islands 
possessing high curvatures at some points 

3. A large range of times required to fill visually similar necks and channels 

4. An observed enhanced coalescence in the presence of an applied electric 
field in the substrate plane 


5.4.3. Cluster Migration (Ref. 2) 

The last mechanism for coalescence considered deals with migration of clusters 
on the substrate surface (Fig. 9-c). Coalescence occurs as a result of collisions 
between separate islandlike crystallites (or droplets) as they execute random 
motion. Evidence provided by the field ion microscope, which has the capabil¬ 
ity of resolving individual atoms, has revealed the migration of dimer and 
trimer clusters. Electron microscopy has shown that crystallites with diameters 



218 


Film Formation and Structure 


of up to 50-100 A can migrate as distinct entities, provided the substrate 
temperature is high enough. Interestingly, the mobility of metal particles can 
be significantly altered in different gas ambients. Not only do the clusters 
translate but they have been observed to rotate as well as even jump upon each 
other and sometimes reseparate thereafter! Cluster migration has been directly 
observed in many systems, e.g., Ag and Au on MoS 2 , Au and Pd on MgO, 
Ag and Pt on graphite in so-called conservative systems, i.e., where the mass 
of the deposit remains constant because further deposition from the vapor has 
ceased. Observations of coalescence in a conservative system include a de¬ 
creased density of particles, increased mean volume of particles, a particle size 
distribution that increases in breadth, and a decreased coverage of the sub¬ 
strate. 

The surface migration of a cap-shaped cluster with projected radius r is 
characterized by an effective diffusion coefficient D{r) with units of cm 2 /sec. 
Presently there exist several formulas for the dependence of D on r based on 
models assumed for cluster migration. The movement of peripheral cluster 
atoms, the fluctuations of areas and surface energies on different faces of 
equilibrium-shaped crystallites, and the glide of crystallite clusters aided by 
dislocation motion are three such models. In each case, D{r) is given by an 
expression of the form (Ref. 11) 

BIT) E c 

D ( r ) = —p — ex P “ ( 5 - 42 ) 

where B(T) is a temperature-dependent constant and s is a number ranging 
from 1 to 3. It comes as no surprise that cluster migration is thermally 
activated with an energy E c related to that for surface self-diffusion, and that 
it is more rapid the smaller the cluster. However, there is a lack of relevant 
experimental data that can distinguish among the mechanisms. In fact, it is 
difficult to distinguish cluster mobility coalescence from Ostwald ripening 
based on observed particle size distributions. 

The interesting effect an applied electric field has in enhancing coalescence 
is worthy of brief comment. Chopra (Ref. 12) has explained the effect on the 
basis of the interaction of the field with electrically charged islands. The 
assumed island charge is derived from ionized vapor atoms and/or the 
potential at the substrate interface. For a spherical particle of radius /*, which 
already possesses a surface free energy, the presence of a charge q contributes 
additional electrostatic energy (i.e., q 2 /r). The increase in total energy is 
accommodated by an increase in surface area. Therefore, the sphere distorts 
into a flattened oblate spheroid, the exact shape being determined by the 
balance of various free energies. The net effect of charging is then to promote 
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further coalescence by ripening, sintering, or cluster mobility processes. 
However, with greater charge or higher fields, the cluster may break up in 
much the same way that a charged droplet of mercury does. 


5 . 5 . Experimental Studies of Nucleation and Growth 


A full complement of microscopic and surface analytical techniques has been 
employed to reveal the physical processes of nucleation and test the theories 
used to describe them. In this section we focus on just a few of the more 
important experimental techniques and the results of some studies. 


5.5.1. Structural Characterization 

The most widely used tool, particularly for island growth studies, is the 
conventional TEM. The technique consists of depositing metals such as Ag and 
Au onto cleaved alkali halide single crystals (e.g., LiF, NaCl, and KC1) in an 
ultrahigh-vacuum system for a given time at fixed R and T. Then the deposit 
is covered with carbon, and the substrate is dissolved away outside the 
system, leaving a rigid carbon replica that retains the metal clusters in their 
original crystal orientation. To enhance the very early stages of nucleation, the 
technique of decoration is practiced. Existing clusters are decorated with a 
lower-melting-point metal, such as Zn or Cd, that does not condense in the 
absence of prior noble metal deposition. This technique renders visible other¬ 
wise invisible clusters that may contain as few as two atoms, thus making 
possible more precise comparisons with theory. A disadvantage of these 
postmortem step-by-step observations is that much useful information on the 
structure and dynamical behavior of the intermediate stages of nucleation and 
growth is lost. For this reason, in situ techniques within both scanning (SEM) 
and transmission electron microscopes, modified to contain deposition sources 
and heated substrates, have been developed but at the expense of decreased 
resolution. The contamination of substrates from hydrocarbons present in the 
specimen chamber is a general problem in such studies necessitating the use of 
high-vacuum, oil-less pumping methods. In addition to direct imaging of film 
nucleation and growth, the electron-diffraction capability of the TEM provides 
additional information on the crystallography and orientation of deposits. 
Diffraction halos and continuous, as well as spotty, diffraction rings character¬ 
ize stages of growth where clusters, which are initially randomly oriented, 
begin to acquire some preferred orientation as shown in Fig. 5-1. When a 
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continuous epitaxial single-crystal film eventually develops, then individual 
diffraction spots appear. 

5.5.2. Auger Electron Spectroscopy (AES) 

The AES technique is based on the measurement of the energy and intensity of 

o 

the Auger electron signal emitted from atoms located within some 5-15 A of a 
surface excited by a beam of incident electrons. The subject of AES will be 
treated in more detail in Chapter 6, but here it is sufficient to note that the 
Auger electron energies are specific to, or characteristic of, the atoms emitting 
them and thus serve to fingerprint them. The magnitude of the AES signal is 
related directly to the abundance of the atoms in question. Consider now the 
deposit substrate combinations corresponding to the three growth modes. If the 
AES signal from the film surface of each is continuously monitored during 
deposition at a constant rate, it will have the coverage or time dependence 
shown schematically in Fig. 5-11, assuming a sticking coefficient of unity 
(Ref. 13). 

In the case of island growth, the signal from the deposit atoms builds slowly 
while that from the substrate atoms correspondingly falls. For S.K. growth the 
signal is ideally characterized by an initial linear increase up to one monolayer 
or sometimes a few monolayers. Then there is a sharp break, and the Auger 
amplitude rises slowly as islands, covering a relatively small part of the 
substrate, are formed. The interpretation of the AES signal in the case of layer 
growth is more complicated. During growth of the initial monolayer, the 
Auger signal is proportional to the deposition rate and sticking coefficient of 
adatoms as well as to the sensitivity in detecting specific elements. For the 
second and succeeding monolayers, the sticking coefficients change. This gives 
rise to slight deviations in slope of the AES signal each time a complete 
monolayer is deposited, and the overall response is therefore segmented as 
indicated in Fig. 5-1 lb. It would be misleading to suggest that all AES data fit 
one of the categories in Fig. 5-11. Complications in interpretation arise from 
atomic contamination, diffusion, and alloying between deposit and substrate, 
and transitions between two- and three-dimensional growth processes. 

5.5.3. Some Results for Metal Films 

Studies of the nucleation and growth of metals, especially the noble ones, on 
assorted substrates have long provided a base for understanding epitaxy and 
film formation processes. An appreciation of the scope of past as well as 
present research activity on metal-substrate systems can be gained by referring 
to Table 5-3. Only epitaxial Au film-substrate combinations are entered in this 
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Table 5-3. Substrates on Which Epitaxial Gold Deposits Have Been Observed 


1. Au-Metal Halides 


CaF 2 

(2) 

KJ 

(10) 

Nal 

(0 

Cdl 2 

(0 

LiF 

( 7 ) 

PbCl 2 (1) 

KBr 

(10) 

NaBr 

(2) 

RbBr (1) 

KC1 

(35) 

NaCl 

(131) 

Rbl 

(1) 

KF 

(2) 

NaF 

(2) 



Au-Metals 





Ag 

(42) 

Fe 

( 3 ) 

Pt 

( 5 ) 

A1 

(0 

Mo 

(6) 

W 

(10) 

Cr 

(2) 

Ni 

(2) 

Zn 

(1) 

Cu 

( 17 ) 

Pb 

(2) 



Cu 3 Au 

(0 

Pd 

(17) 



Au-Selected Semiconductors and Chalcogenides 

C (graphite) 

( 5 ) 

MoS 2 

(20) 

ZnS 

Ge 


(2) 

PbS 

(6) 


Si 


(16) 

PbSe 

( 3 ) 


GaSb 


( 3 ) 

PbTe 

(2) 


GaAs 


( 5 ) 

SnTe 

0) 



4. Au-Carbonates, Oxides, Mica 


CaC0 3 (3) MgO (20) Mica (23) 

AI 2 0 3 (sapphire) (2) Si0 2 (quartz) (4) 

BaTi0 3 (2) ZnO (2) 


Numbers of references dealing with particular Au-substrate system are in parentheses. 
From Ref. 14. 


recent compilation (Ref. 14), and yet they correspond to over 300 references to 
the research literature. Other epitaxial metal film-substrate systems have been 
comprehensively tabulated (Ref. 15) together with deposition methods and 
variables. The sheer numbers and varieties of metals and substrates involved 
point to the fact that epitaxy is a common phenomenon. In the overwhelming 
number of studies, island growth is involved. Perusal of Table 5-3 reveals that 
epitaxial Au films can be deposited on metallic, covalent, and ionic substrates. 
Although the majority of substrate materials listed have cubic crystal struc¬ 
tures, this is not an essential requirement for epitaxy that occurs, for example, 
on hexagonal close-packed Zn as well as on monoclinic mica. That epitaxy is 
possible between materials of different chemical bonding and crystal structure 
means that its origins are not simple. The long-held belief that a small 
difference in lattice constant between film and substrate is essential for epitaxy 
is mistaken; small lattice mismatch is neither a necessary nor sufficient 
condition for epitaxy. The lattice parameter of the metal can either be larger or 
smaller than that of the substrate. Having said this, it is also true that the defect 
density in these metal film “island” epitaxial systems is very much larger than 
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in the “planar” epitaxial semiconductor systems discussed in Chapter 7. Very 
close lattice matching is maintained in the planar epitaxial systems. 

The following specific findings briefly characterize the numerous studies of 
epitaxy of metal films on ionic substrates (Ref. 12). 

L Substrate . The FCC metals generally grow with parallel orientations on 
(100), (110), and (111) surfaces of NaCl, but with the (111) plane parallel to 
the (100) mica cleavage plane. Complex relative positioning of atoms due to 
translational, and more frequently rotational movements, appears to be the 
significant variable in epitaxy rather than lattice parameter differences. 

2. Temperature . High substrate temperatures facilitate epitaxy by (a) low¬ 
ering supersaturation levels, (b) stimulating desorption of impurities, (c) 
enhancing surface diffusion of adatoms into equilibrium sites, and (d) promot¬ 
ing island coalescence. The concept of an epitaxial temperature T E has been 
advanced for alkali halide substrates. Temperature T E depends on the nature of 
the substrate as well as the deposition rate. For example, T E for Ag on LiF, 
NaCl, KC1, and K1 was determined to be 340 °C, 150 °C, 130 °C and 80 °C, 
respectively. The progressive decrease in T E correlates with increases in lattice 
parameter and enhanced ionic (both positive and negative) polarizabilities. The 
latter facilitate attractive forces between metal and substrate atoms. 

3. Deposition Rate . In general, low deposition rates, R, foster epitaxy. It 
has been established that epitaxy occurs when R < const e~ E ^ kT£ . This in¬ 
equality is satisfied physically when the rate at which adatoms settle into 
equilibrium sites exceeds the rate at which adatoms collide with each other. 
Such an interpretation requires that E be a surface diffusion activation energy 
rather than ^des + E 2 in Eq. 5-28b. The reader should compare this criterion 
for T e with those proposed earlier. 

4 . Contamination . The effect of contamination is a source of controversy. 
It has been reported that epitaxy of FCC metals is more difficult on ultrahigh- 
vacuum-cleaved alkali halide substrates than on air-cleaved crystals. Appar¬ 
ently air contamination increases the density of initial nuclei inducing earlier 
coalescence. 


5 . 6 . Grain Structure of Films and Coatings 


5.6.1. Zone Models for Evaporated and Sputtered Coatings 

Until now the chapter has largely focused on the early stages of the formation 
of both polycrystalline and single-crystal films. In this section the leap is made 
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to the regime of the fully developed grain structure of thick polycrystalline 
films and coatings. As we have seen, condensation from the vapor involves 
incident atoms becoming bonded adatoms, which then diffuse over the film 
surface until they desorb or, more commonly, are trapped at low-energy lattice 
sites. Finally, incorporated atoms reach their equilibrium positions in the 
lattice by bulk diffusive motion. This atomic odyssey involves four basic 
processes: shadowing, surface diffusion, bulk diffusion, and desorption. The 
last three are quantified by the characteristic diffusion and sublimation activa- 


l SOLATED ZONE 1 
GROWTH DEFECTS 


SHADOWING 

PROCESSES 


SURFACE 
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Figure 5-12. Schematic representation showing the superposition of physical pro¬ 
cesses which establish structural zones. (Reprinted with permission from Ref. 17, © 


1977 Annual Reviews Inc.). 
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tion energies whose magnitudes scale directly with the melting point T m of the 
condensate. Shadowing is a phenomenon arising from the geometric constraint 
imposed by the roughness of the growing film and the line-of-sight impinge¬ 
ment of arriving atoms. The dominance of one or more of these four processes 
as a function of substrate temperature T s is manifested by different structural 
morphologies. This is the basis of the zone structure models that have been 
developed to characterize film and coating grain structures. 

The earliest of the zone models was proposed by Movchan and Demchishin 
(Ref. 16), based on observations of very thick evaporated coatings (0.3 to 2 
mm) of metals (Ti, Ni, W, Fe) and oxides (Zr0 2 and A1 2 0 3 ) at rates ranging 
from 12,000 to 18,000 A/min. The structures were identified as belonging to 
one of three zones (1, 2, 3). A similar zone scheme was introduced by 
Thornton (Ref. 17) for sputtered metal deposits, but with four zones (1, T, 2, 
3). His model is based on structures developed in 20- to 250-/xm-thick 
magnetron sputtered coatings deposited at rates ranging from 50 to 20,000 
A/min. The exploded view of Fig. 5-12 illustrates the effect of the individual 
physical processes on structure and how they depend on substrate temperature 


Table 5-4. Zone Structures in Thick Evaporated and Sputtered Coatings 




Structural 

Film 

Zone 

T s /T m 

Characteristics 

Properties 

1 (E) 

< 0.3 

Tapered crystals, dome 

High dislocation 



tops, voided boundaries. 

density, hard. 

1 (S) 

< 0.1 at 0.15 Pa 

Voided boundaries, 

Hard. 


to < 0.5 at 4 Pa 

fibrous grains. Zone 1 is 
promoted by substrate 
roughness and oblique 
deposition. 


T (S) 

0.1 to 0.4 at 

Fibrous grains, dense 

High dislocation 


0.15 Pa, - 0.4 to 

grain boundary arrays. 

density, hard. 


0.5 at 4 Pa 


high strength, low 
ductility. 

2(E) 

0.3 to 0.5 

Columnar grains, dense 
grain boundaries. 

Hard, low ductility. 

2(S) 

0.4 to 0.7 



3(E) 

0.5-1.0 

Large equiaxed grains, 

Low dislocation 



bright surface. 

density, soft 

3 (S) 

0.6-1.0 


recrystallized 

grains. 


Note: (E) refers to evaporated. (S) refers to sputtered. 
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and inert sputtering gas (Ar) pressure. A comparison between zone structures 
and properties for evaporated and sputtered coatings is made in Table 5-4. In 
general, analogous structures evolve at somewhat lower temperatures in evapo¬ 
rated films than in sputtered films. Zone 1 structures, which appear in 
amorphous as well as crystalline deposits, are the result of shadowing effects 
that overcome limited adatom surface diffusion. In the zone 2 regime, struc¬ 
tures are the result of surface diffusion-controlled growth. Lattice diffusion 
dominates at the highest substrate temperatures, giving rise to the equiaxed 
recrystallized grains of zone 3. 

In contrast to metals, ceramic materials tend to have low hardness at low 
values of indicating that their strength is adversely affected by lattice 

and grain-boundary imperfections. Ceramics also become harder, not softer, in 
zones 2 and 3. 

5.6.2. Zone Model for Evaporated Metal Films 

In a recent study (Ref. 18), a zone model for thin evaporated metal films 1000 

o 

A thick has been developed. The results for 10 elemental films are shown in 
Fig. 5-13a, where the maximum and minimum grain size variation with T s is 
shown. For T s /T M <0.2 (T m /T s > 5), the grains are equiaxed with a 
diameter of less than 200 A. Within the range 0.2 < T S /T M < 0.3, some 

o 

grains larger than 500 A appear surrounded by smaller grains. Columnar 
grains make their appearance at t s /t m >0 .37, and still higher temperatures 
promote lateral growth with grain sizes larger than the film thickness as shown 
schematically in Fig. 5-13b. Although the same zone classification scheme has 
been used for both sputtered and evaporated films, the grain morphology in 
zones 1 and T differ. Zones 1 and T (a transition zone) possess structures 
produced by continued renucleation of grains during deposition and subsequent 
grain growth. The result is the bimodal grain structure of zone T. Zone 2 
structures are the result of granular epitaxy and grain growth. The variation in 
grain structure in zones 1, T, and 2 presumably arises because different grain 
boundaries become mobile at different temperatures. In zone 1, virtually all 
grain boundaries are immobile, whereas in zone 2 they are all mobile. 
Consequently, at higher temperatures the probability of any boundary sweep¬ 
ing across a grain and reacting to form another mobile boundary is increased. 
Coupled with enhanced surface diffusion, a decrease in porosity results in zone 
3. Bulk grain growth and surface recrystallization occur at the highest tempera¬ 
tures with the largest activation energies. This is evident in Fig. 5-13a, which 
shows the steep dependence of grain size with T s for Tm / T s < 3. 
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Figure 5-13. (a) Plot of maximum and minimum grain size variation with homolo¬ 

gous substrate temperature for 10 different evaporated metals, (b) Zone model for 
evaporated metal films. (From Ref. 18). 


5.6.3. Columnar Grain Structure 

The columnar grain structure of thin films has been a subject of interest for 
several decades. This microstructure consisting of a network of low-density 
material surrounding an array of parallel rod-shaped columns of higher density 
has been much studied by transmission and scanning electron microscopy. As 
noted, columnar structures are observed when the mobility of deposited atoms 
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is limited, and therefore their occurrence is ubiquitous. For example, columnar 
grains have been observed in high-melting-point materials (Cr, Be, Si, and 
Ge), in compounds of high binding energy (TiC, TiN, CaF 2 , and PbS), and in 
non-noble metals evaporated in the presence of oxygen (Fe and Fe-Ni). 
Amorphous films of Si, Ge, SiO, and rare earth-transition metal alloys (e.g., 
Gd-Co), whose very existence depends on limited adatom mobility, are 
frequently columnar when deposited at sufficiently low temperature. Inasmuch 
as grain boundaries are axiomatically absent in amorphous films, it is more 
correct to speak of columnar morphology in this case. This columnar 
morphology is frequently made visible by transverse fracture of the film 
because of crack propagation along the weak, low-density intercolumnar 
regions. Magnetic, optical, electrical, mechanical, and surface properties of 
films are affected, sometimes strongly, by columnar structures. In particular, 
the magnetic anisotropy of seemingly isotropic amorphous Gd-Co films is 
apparently due to its columnar structure and interspersed voids. A collection 
of assorted electron micrographs of film and coating columnar structures is 
shown in Fig. 5-14. Particularly noteworthy are the structural similarities 
among varied materials deposited by different processes, suggesting common 
nucleation and growth mechanisms. 

An interesting observation (Ref. 20) on the geometry of columnar grains has 
been formulated into the so-called tangent rule expressed by Eq. 5-43. Careful 
measurements on obliquely evaporated A1 films reveal that the columns are 
oriented toward the vapor source, as shown in the microfractograph of Fig. 
5-15. The angle 0 between the columns and substrate normal is universally 
observed to be somewhat less than the angle a, formed by the source direction 
and substrate normal. An experimental relation connecting values of a and /3, 
obtained by varying the incident vapor angle over a broad range (0 < a < 90°), 
was found to closely approximate 

tan a - 2 tan 0. (5-43) 

The very general occurrence of the columnar morphology implies a simple 
nonspecific origin such as geometric shadowing, which affords an understand¬ 
ing of the main structural features. 

Recently, a closer look has been taken of the detailed microstructure of 
columnar growth in sputtered amorphous Ge and Si, as well as TiB 2 , WO,. 
BN, and SiC thin films (Ref. 21). Interestingly, an evolutionary development 

o 

of columnar grains ranging in size from — 20 to 4000 A occurs. When 
prepared under low adatom mobility conditions ( t s /t m <o .5), three general 
structural units are recognized; nano-, micro-, and macrocolumns together 
with associated nano-, micro-, and macrovoid distributions. A schematic of 
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Figure 5-14. Representative set of cross-sectional transmission electron micrographs 
of thin films illustrating variants of columnar microstructures, (a) acid-plated Cu, (b) 
sputtered Cu, (c) sputtered Co-Cr-Ta alloy, (d) CVD silicon (also Fig. 4-12), (e) 
sputtered W, D = dislocation, T = twin. (Courtesy of D. A. Smith, IBM T. J. Watson 
Research Lab. Reprinted with permission from Trans-Tech Publication, from Ref. 19). 








230 


Film Formation and Structure 



Figure 5-15. Electron micrograph of a replica of a —2 ^m-thick A1 film. Inset shows 
deposition geometry. (From Ref. 20). 

these interrelated, nested columns is shown in Fig. 5-16. It is very likely that 
the columnar grains of zones 1 and T in the Thornton scheme are composed of 
nano- and microcolumns. 

Computer simulations (Ref. 22) have contributed greatly to our understand¬ 
ing of the origin of columnar grain formation and the role played by shadow¬ 
ing. By serially “evaporating” individual hard spheres (atoms) randomly onto 
a growing film at angle a, the structural simulations in Fig. 5-17 were 
obtained. The spheres were allowed to relax following impingement into the 
nearest triangular pocket formed by three previously deposited atoms, thus 
maximizing close atomic packing. The simulation shows that limited atomic 
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mobility during low-temperature deposition reproduces features observed ex¬ 
perimentally. As examples, film density decreases with increasing a, high- 
density columnlike regions appear at angles for which /3 < a, and film 
densification is enhanced at elevated temperatures. Lastly, the column orienta¬ 
tions agree well with the tangent rule. The evolution of voids occurs if those 
atoms exposed to the vapor beam shield or shadow unoccupied sites from 
direct impingement, and if post-impingement atom migration does not succeed 
in filling the voids. This self-shadowing effect is thus more pronounced the 
lower the atomic mobility and extent of lattice relaxation. 

An important consequence of the columnar-void microstructure is the insta¬ 
bility it engenders in optical coatings exposed to humid atmospheres. Under 
typical evaporation conditions (~ 10" 6 torr, T s = 30-300 °C and deposition 
rate of 300-3000 A/s) dielectric films generally develop a zone 2 structure. 
Water from the ambient is then absorbed throughout the film by capillary 
action. The process is largely irreversible and alters optical properties such as 



Figure 5-16. Schematic representation of macro, micro and nano columns for sput¬ 
tered amorphous Ge films. (Courtesy of R. Messier, from Ref. 21). 







Figure 5-17. Computer-simulated microstructure of Ni film during deposition at 


different times for substrate temperatures of (a) 350 K and (b) 420 K. The angle of 


vapor deposition a is 45°. (From Ref. 22). 


index of refraction and absorption coefficient. Moisture-induced degradation 
has plagued optical film development for many years. A promising remedy for 
this problem is ion bombardment, which serves to compact the film structure. 
This approach is discussed further in Chapters 3 and 11. 


5.6.4. Film Density 

A reduced film density relative to the bulk density is not an unexpected 
outcome of the zone structure of films and its associated porosity. Because of 
the causal structure-density and structure-property relationships, density is 
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expected to strongly influence film properties. Indeed we have already alluded 
to the deleterious effect of lowered overall film densities on optical and 
mechanical properties. A similar degradation of film adhesion and chemical 
stability as well as electrical and magnetic properties can also be expected. 
Measurement of film density generally requires a simultaneous determination 
of film mass per unit area and thickness. Among the experimental findings 
related to film density are the following (Ref. 23): 

1. The density of both metal and dielectric films increases with thickness 
and reaches a plateau value that asymptotically approaches that of the bulk 
density. The plateau occurs at different thicknesses, depending on material 
deposition method and conditions. In Al, for example, a density of 2.1 g/cm 3 
at 250 A rises to 2.58 g/cm 3 above 525 °C and then remains fairly constant 
thereafter. As a reference, bulk Al has a density of 2.70 g/cm 3 . The gradient 
in film density is thought to be due to several causes, such as higher crystalline 
disorder, formation of oxides, greater trapping of vacancies and holes, pores 
produced by gas incorporation, and special growth modes that predominate in 
the early stages of film formation. 

2. Metal films tend to be denser than dielectric films because of the larger 
void content in the latter. A quantitative measure of the effect of voids on 
density is the packing factor P, defined as 

volume of solid 

P= -. (5-44) 

total volume of film (solid + voids) 

Typical values of P for metals are greater than 0.95, whereas for fluoride 
films (e.g., MgF 2 , CaF 2 ) P values of approximately 0.7 are realized. 
However, by raising T s for the latter, we can increase P to almost unity. 

3. Thin-film condensation is apparently accompanied by the incorporation 
of large nonequilibrium concentrations of vacancies and micropores. Whereas 
bulk metals may perhaps contain a vacancy concentration of 10~ 3 at the 
melting point, freshly formed thin films can have excess concentrations of 
1(T 2 at room temperature. In addition, microporosity on a scale much finer 
than imagined in zones 1 and T has been detected by TEM phase (defocus) 

o 

contrast techniques (Ref. 24). Voids measuring 10 A in size, present in 
densities of about 10 17 cm’ 3 have been revealed in films prepared by 
evaporation as shown in Fig. 5-18. The small voids appear as white dots 
surrounded by black rings in the underfocused condition. Microporosity is 
evident both at grain boundaries and in the grain interior of metal films. In 
dielectrics a continuous network of microvoids appears to surround grain 
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Figure 5-18. Transmission electron micrograph showing microvoid distribution in 
evaporated Au films. (Courtesy of S. Nakahara, AT&T Bell Laboratories.) 

boundaries. This crack network has also been observed in Si and Ge films, 
where closer examination has revealed that it is composed of interconnecting 
cylindrical voids. Limited surface diffusion, micro-self-shadowing effects, and 
stabilization by adsorbed impurities encourage the formation of microporosity. 
In addition to reducing film density, excess vacancies and microvoids may play 
a role in fostering interdiffusion in thin-film couples where the Kirkendall 
effect has been observed (see Chapter 8). The natural tendency to decrease the 
vacancy concentration through annihilation is manifested by such film changes 
as stress relaxation, surface faceting, adhesion failure, recrystallization and 
grain growth, formation of dislocation loops and stacking faults, and decrease 
in hardness. 


5 . 7 . Amorphous Thin Films 


5.7.1. Systems, Structures, and Transformations 

Amorphous or glassy materials have a structure that exhibits only short-range 
order or regions where a predictable placement of atoms occurs. However, 
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within a very few atom spacings, this order breaks down, and no long-range 
correlation in the geometric positioning of atoms is preserved. Although bulk 
amorphous materials such as silica glasses, slags, and polymers are well 
known, amorphous metals were originally not thought to exist. An interesting 
aspect of thin-film deposition techniques is that they facilitate the formation of 
amorphous metal and semiconductor structures relative to bulk preparation 
methods. 

As noted, production of amorphous films requires very high deposition rates 
and low substrate temperatures. The latter immobilizes or freezes adatoms on 
the substrate where they impinge and prevents them from diffusing and seeking 
out equilibrium lattice sites. By the mid-1950s Buckel (Ref. 25) produced 
amorphous films of pure metals such as Ga and Bi by thermal evaporation onto 
substrates maintained at liquid helium temperatures. Alloy metal films proved 
easier to deposit in amorphous form because each component effectively 
inhibits the atomic mobility of the other. This meant that higher substrate 
temperatures (- 77 K) could be tolerated and that vapor quench rates did not 
have to be as high as those required to produce pure amorphous metal films. 
Although they are virtually impossible to measure, vapor quench rates in 
excess of 10 10 °C/sec have been estimated. From laboratory curiosities, 
amorphous Si, Se, GdCo, and GeSe thin films have been exploited for such 
applications as solar cells, xerography, magnetic bubble memories, and high- 
resolution optical lithography, respectively. 

Important fruits of the early thin-film work were realized in the later 
research and development activities surrounding the synthesis of bulk amor¬ 
phous metals by quenching melts. Today continuously cast ribbon and strip of 
metallic glasses (Metglas) are commercially produced for such applications as 
soft magnetic transformer cores and brazing materials. Cooling rates of - 10 6 
°C/sec are required to prevent appreciable rates of nucleation and growth of 
crystals. Heat transfer limitations restrict the thickness of these metal glasses to 
less than 0.1 mm. In addition to achieving the required quench rates, the alloy 
compositions are critical. Most of the presently known glass-forming binary 
alloys fall into one of four categories (Ref. 26): 

1. Transition metals and 10-30 at% semimetals 

2. Noble metals (Au, Pd, Cu) and semimetals 

3. Early transition metals (Zr, Nb, Ta, Ti) and late transition metals (Fe, Ni, 
Co, Pd) 

4. Alloys consisting of IIA metals (Mg, Ca, Be) 

In common, many of the actual glass compositions correspond to where 
“deep” (low-temperature) eutectics are found on the phase diagram. 

Amorphous thin films of some of these alloys as well as other metal alloys 
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and virtually all elemental and compound semiconductors, semimetals, oxides, 
and chalcogenide (i.e., S-, Se-, Te-containing) glasses have been prepared by a 
variety of techniques. Amorphous Si films, for example, have been deposited 
by evaporation, sputtering, and chemical vapor deposition techniques. In 
addition, large doses of ion-implanted Ar or Si ions will amorphize surface 
layers of crystalline Si. Even during ion implantation of conventional dopants, 
local amorphous regions are created where the Si matrix is sufficiently 
damaged, much to the detriment of device behavior. Lastly, pulsed laser 
surface melting followed by rapid freezing has produced amorphous films in Si 
as well as other materials (see Chapter 13). 

5.7.2. Au - Co and Ni - Zr Amorphous Films 

It is instructive to consider amorphous Co-30Au films since they have been 
well characterized structurally and through resistivity measurements (Ref. 27). 
The films were prepared by evaporation from independently heated Co and Au 
sources onto substrates maintained at 80 K. Dark-field electron microscope 
images and corresponding diffraction patterns are shown side by side in Fig. 
5-19. The as-deposited film is rather featureless with a smooth topography, and 
the broad halos in the diffraction pattern cannot be easily and uniquely assigned 
to the known lattice spacings of the crystalline alloy phases in this system. Both 
pieces of evidence point to the existence of an amorphous phase whose 
structural order does not extend beyond the next-nearest-neighbor distance. 
The question of whether so-called amorphous films are in reality microcrys¬ 
talline is not always easy to resolve. In this case, however, the subsequent 
annealing behavior of these films was quite different from what is expected of 
fine-grained crystalline films. Heating to 470 K resulted in the face-centered 
cubic diffraction pattern of a single metastable phase, whereas at 650 K, lines 
corresponding to the equilibrium Co and Au phases appeared. Resistivity 
changes accompanying the heating of Co-38Au (an alloy similar to Co-30Au) 
revealed a two-step transformation as shown in Fig. 5-20. Beyond 420 K there 
is an irreversible change from the amorphous structure to a metastable FCC 
crystalline phase, which subsequently decomposes into equilibrium phases 
above 550 K. The final two-phase structure is clearly seen in Fig. 5-19. The 
high resistivity of the amorphous films is due to the enhanced electron 
scattering by the disordered solid solution. Crystallization to the FCC structure 
reduces the resistivity, and phase separation, further still. 

Both the amorphous and metastable phases are stable over a limited tempera¬ 
ture range in which the resistivity of each can be cycled reversibly. Once the 
two-phase structure appears, it, of course, can never revert to less thermody- 
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Figure 5-19. Electron micrographs and diffraction patterns of Co-30at%Au: (top) as 
deposited at 80 K, warmed to 300 K (amorphous); (middle) film warmed to 470 K 
(single-phase FCC structure); (bottom) film heated to 650 K (two-phase equilibrium). 
(From Ref. 27). 

namically stable forms. This amorphous-crystalline transformation apparently 
proceeds in a manner first suggested by Ostwald in 1897. According to the 
so-called Ostwald rule, a system undergoing a reaction proceeds from a less 
stable to a final equilibrium state through a succession of intermediate 
metastable states of increasing stability. In this sense, the amorphous phase is 
akin to a quenched liquid phase. Quenched films exhibit other manifestations 
of thermodynamic instability. One is increased atomic solubility in amorphous 
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Figure 5-20. Resistivity of a Co-38at%Au film as a function of annealing tempera¬ 
ture. Reversible values of dpjdT in various structural states of the film are shown 
together with changes in p during phase transformation. (From Ref. 27). 


or single-phase metastable matrices. For example, the equilibrium phase 
diagram for Ag-Cu is that of a simple eutectic with relatively pure terminal 
phases of Ag and Cu that dissolve less than 0.4 at% Cu and 0.1 at% Ag, 
respectively, at room temperature. These limits can be extended to 35 at% on 
both sides by vapor-quenching the alloy vapor. Similar solubility increases 
have been observed in the Cu-Mg, Au-Co, Cu-Fe, Co-Cu, and Au-Si alloy 
systems. 

Confounding the notion that rapid quenching of liquids or vapors is required 
to produce amorphous alloy films is the startling finding that they can also be 
formed by solid-state reaction. Consider Fig. 5-21, which shows the result of 
annealing a bilayer couple consisting of pure polycrystalline Ni and Zr films at 
300 °C for 4 h. The phase diagram predicts negligible mutual solid solubility 
and extensive intermetallic compound formation; surprisingly, an amorphous 
NiZr alloy film is observed to form. Clearly, equilibrium compound phases 
have been bypassed in favor of amorphous phase nucleation and growth, as 
kinetic considerations dominate the transformation. The effect, also observed 
in Rh-Si, Si—Ti, Au-La, and Co-Zr systems, is not well understood. 
Apparently the initial bilayer film passes to the metastable amorphous state via 
a lower energy barrier than that required to nucleate stable crystalline com¬ 
pounds. However, the driving force for either transformation is similar. Unlike 
other amorphous films, extensive interdiffusion can be tolerated in NiZr 
without triggering crystallization. 
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Figure 5-21. Cross-sectional electron micrograph of an amorphous Ni-Zr alloy film 
formed by annealing a crystalline bilayer film of Ni and Zr at 300 °C for 4 hours. 
(Courtesy of K. N. Tu, IBM Corp., T. J. Watson Research Lab., from Ref. 28). 

5.7.3. A Model To Simulate Structural Effects in Thin Films 

One of the outcomes of their research on quenched alloy films was an engaging 
mechanical model Mader and Nowick (Ref. 29) developed to better explain the 
experimental results. Many phenomena observed in pure and alloy thin-film 
structures are qualitatively simulated by this model. For this reason, it is 
valuable as a pedogogic tool and worth presenting here. The “atoms” compos¬ 
ing the thin films were acrylic plastic balls of different sizes. They were rolled 
down a pinball-like runway tilted at 1.5° to the horizontal to simulate the 
random collision of evaporant atoms. A monolayer of these atoms was then 
“deposited” on either an “amorphous” or “crystalline” substrate. The 
former was a flat sheet of plastic, and the latter contained a perfect two-dimen¬ 
sional periodic array of interstices into which atoms could nest. Provision was 
made to alter the alloy composition by varying the ball feed. A magnetic 
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vibrator simulated thermal annealing. To obtain diffraction patterns from the 
arrays, they prepared reduced negatives (with an array size of about 4 mm 
square). The balls appeared transparent on a dark background with a mean ball 
separation of — 0.13 mm. Fraunhofer optical diffraction patterns were gener¬ 
ated by shining light from a He-Ne laser (X = 6328 A) on the negative 
mounted in contact with a 135-mm lens of a 35-mm camera. The resulting 
photographs are reproduced here. 



Figure 5-22. Atomic sphere film structures and corresponding Fraunhofer diffraction 
patterns for (a) perfect array, (b) stacking fault, (c) pure film; low deposition rate, (d) 
pure film; high deposition rate. (Reprinted with permission from the IBM Corp., from 
A. S. Nowick and S. R. Mader, IBM J. Res. Dev . 9, 358, 1965). 
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The perfect array of spheres of one size is shown together with the 
corresponding diffraction pattern in Fig. 5-22a. A hexagonal pattern of sharp 
spots, very reminiscent of electron diffraction patterns of single-crystal films, 
is obtained, reflecting the symmetry of the close-packed array. After creation 
of a stacking fault in the structure, the diffraction pattern shows streaks (Fig. 
5-22b). These run perpendicular to the direction of the fault in the structure. 

The effect of deposition rate is shown in Figs. 5-22c and 5-22d. When the 
film is deposited “slowly,” there are grains, vacancies, and stacking faults 
present in the array. Relative to Fig. 5-22a, the diffraction spots are broad¬ 
ened, a precurser to ring formation. In Fig. 5-22d, the film is deposited at a 
“high” rate and the grain structure is considerably finer and more disordered 
with numerous point defects, voids, and grain boundaries present. Now, 
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semicontinuous diffraction rings appear, which are very much like the common 
X-ray Debye-Scherer rings characteristic of polycrystals. Interestingly, the 
intensity variation around the ring is indicative of preferred orientation. When 
the rapidly deposited films are annealed through vibration, the array densities, 
vacancies are annihilated, faults are eliminated, and grains reorient, coalesce, 
and grow. The larger grains mean a return to the spotted diffraction pattern. 




b 


Figure 5-23. Atomic sphere film structure for concentrated alloy (50A-50B, 27% 
size difference: (a) as-deposited (amorphous); (b) vibration annealed. (Reprinted with 
permission from the IBM Corp., from A. S. Nowick and S. R. Mader, IBM J, Res. 
Dev . 9, 358, 1965). 
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We now turn our attention to alloy films. For “concentrated” alloys 
containing equal numbers of large and small spheres with a size difference of 
27%, the as-deposited structure is amorphous, as indicated in Fig. 5-23.. The 
diffraction pattern contains broad halos. Upon vibration annealing, the film 
densifies slightly, but the atomic logjam cannot be broken up. There is no 
appreciable change in its structure or diffraction pattern—it is still amorphous. 
For less concentrated alloys (~ 17%), however, the as-deposited structure is 
very fine grained but apparently crystalline. 

All of the foregoing results were for films deposited on the smooth sub¬ 
strate. The “crystalline substrate” affords the opportunity to model epitaxy 
phenomena. Pure films deposit in almost perfect alignment with the substrate 
when deposited slowly. Imperfect regions are readily eliminated upon anneal¬ 
ing and nearly perfect single crystals are obtained. Rapidly deposited films are 
less influenced by the underlying substrate and remain polycrystalline after 
annealing. Clearly epitaxial growth is favored by low deposition rates. The 
presence of alloying elements impeded epitaxy from occurring in accord with 
experience. 

The foregoing represents a sampling of the simulations of the dependence of 
film structure on deposition variables. Readers interested in this as well as 
other mechanical models of planar arrays of atoms, such as the celebrated 
Bragg bubble raft model (Ref. 30), should consult the literature on the subject. 
Much insight can be gained from them. 



Exercises 



1- Under the same gas-phase supersaturation, cube-shaped nuclei are ob¬ 
served to form homogeneously in the gas and heterogeneously both on a 
flat substrate surface and at right-angle steps on this surface. For each of 
these three sites calculate the critical nucleus size and energy barrier for 
nucleation. 

2 . A cylindrical pill-like cluster of radius r nucleates on a dislocation that 
emerges from the substrate. The free-energy change per unit thickness is 
given by 

AG = irr 2 AG V + liryr -I- A - B In r, 
where A - B In r represents the dislocation energy within the cluster. 

a. Sketch AG vs. r (note at r = 0, AG = oo). 

b. Determine the value of r*. 
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c. Show that when A G v B/iry 2 > 1/2, AG monotonically decreases 
with r, but when kGyB/iry 2 < 1/2 there is a turnaround in the 
curve. (The latter case corresponds to a metastable state and associ¬ 
ated energy barrier.) 

3- Cap-shaped nuclei on substrates grow both by direct impingement of 
atoms from the vapor phase as well as by attachment of adatoms diffusing 
across the substrate surface. 

a. In qualitative terms how will the ratio of the two mass fluxes depend 
on nucleus size, area density of nuclei, and deposition rate. 

b. Write a quantitative expression for the flux ratio, making any reason¬ 
able assumptions you wish. 

4 . Two spherical nuclei with surface energy 7 having radii r, and r 2 
coalesce in the gas phase to form one spherical nucleus. If mass is 
conserved, calculate the energy reduction in the process. Suppose two 
spherical caps of different radii coalesce on a planar substrate to form 
one cap-shaped nucleus. Calculate the energy reduction. 

5 - Two spherical nuclei of radii r, and r 2 are separated by a distance /. If 
r x > r 2 , derive an expression for the time it will take for the smaller 
nucleus to disappear by sequential atomic dissolution and diffusion to the 
larger nucleus by Ostwald ripening. Assume the diffusivity of atoms on 
the surface is D s . Make simplifying assumptions as you see fit. 

6 - Assume that the two nuclei in Fig. 5-10 coalesce by a sintering mecha¬ 
nism. 

a. By carefully measuring the neck width and plotting it as a function of 
time, determine the value of n in the general sintering kinetics 
formula. 

b. From these data, estimate a value for the approximate diffusivity. 
Assume 7 = 1000 ergs/cm 2 , T = 400 °C, and Q = 17 x 10 ' 24 
cm 3 /atom. 

7 . A Film is deposited on a substrate by means of evaporation. In the 
expression for the rate of heterogeneous nucleation (Eq. 5-17), identity 
which terms are primarily affected by 

a. raising the temperature of the evaporant source. 

b. changing the substrate material. 

c. doubling the source-substrate distance. 



Exercises 


245 


d. raising the substrate temperature. 

e. improving the system vacuum. 

In each case qualitatively describe the nature of the change. 

8 . From data shown in Fig. 5-5 calculate values for E 6es , E s , and E t + . (For 
answers consult Ref. 3, page 8-23.) 

9 . Three different methods for estimating the temperature for epitaxial 
growth of films have been discussed in this chapter. 

a. Comment on the similarities and differences in the respective ap¬ 
proaches. 

b. How well do they predict the experimental findings of Fig. 5-4? 

10 . Derive expressions for the epitaxial transition temperatures 7 ' 1 _» 3 and 

T 2 -3- 

11. During examination of the grain structure of a film evaporated from a 
point source onto a large planar substrate, the following observations 
were made as a function of position: 

1. There is a film thickness variation. 

2. There is a grain size variation. 

3. There is a variation in the angular tilt of columnar grains. 

Explain the physical reasons for these observations. 

12 . The formation of three-dimensional crystallites from an amorphous matrix 
undergoing transformation by nucleation and growth processes follows 
the time (/) dependent kinetics given by 

TrNv 3 t 4 

/(0 = 1 - exp--—. 

N is the nucleation rate of crystallites (per unit volume), v is their growth 
velocity, and / is the fractional extent of transformation. 

a. N is small near the critical transformation temperature and at low 
temperature, but larger in between. Why? 

b. v is usually larger for higher temperatures. Why? 

c. Schematically sketch f(t) vs. t (or In /) at a series of temperatures. 
Note that an incubation time dependent on temperature is suggested. 

13 . a Atoms on either side of a curved grain boundary (GB) reside on 

surfaces of different curvature, establishing a local chemical potential 
gradient that will drive GB migration. Use the Nernst-Einstein equa¬ 
tion to show that the grain size will tend to grow with parabolic 
kinetics. 
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b. Part (a) is valid when the film grain size is smaller than the film 
thickness. Why? If the reverse is true, suggest why parabolic growth 
kinetics may not be observed. 
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Characterization of 
Thin Films 


6 . 1 . Introduction 


Scientific disciplines are identified and differentiated by the experimental 
equipment and measurement techniques they employ. The same is true of 
thin-film science and technology. For the first half of this century, interest in 
thin films centered around optical applications. The role played by films was 
largely a utilitarian one, necessitating measurement of film thickness and 
optical properties. However, with the explosive growth of thin-film utilization 
in microelectronics, there was an important need to understand the intrinsic 
nature of films. With the increasingly interdisciplinary nature of applications, 
new demands for film characterization and other property measurements arose. 
It was this necessity that drove the creativity and inventiveness that culminated 
in the development of an impressive array of commercial analytical instru¬ 
ments. These are now ubiquitous in the thin-film, coating, and broader 
scientific communities. In many instances, it was a question of borrowing and 
modifying existing techniques employed in the study of bulk materials (e.g., 
X-ray diffraction, microscopy, mechanical testing) to thin-film applications. In 
other cases well-known physical phenomena (e.g., electron spectroscopy, 
nuclear scattering, mass spectroscopy) were exploited. A partial list of the 
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Table 6-1. Analytical Techniques 


Primary Beam 

Energy Range 

Secondary Signal 

Acronym 

Electron 

20-200 eV 

Electron 

LEED 


300-30,000 eV 

Electron 

SEM 


1 keV-30 keV 

X-ray 

EMP (EDX) 


500 eV-lOkeV 

Electron 

AES 


100-400 keV 

Electron 

TEM 


100-400 keV 

Electron, X-ray 

STEM 


100-400 keV 

Electron 

EELS 

Ion 

0.5-2.0 keV 

Ion 

ISS 


1-15 keV 

Ion 

SIMS 


1-15 eV 

Atoms 

SNMS 


1 keV and up 

X-ray 

PIXE 


5-20 keV 

Electron 

SIM 


> 1 MeV 

Ion 

RBS 

Photon 

> 1 keV 

X-ray 

XRF 


> 1 keV 

X-ray 

XRD 


> 1 keV 

Electron 

ESCA, XPS 


Laser 

Ions 

— 


Laser 

Light 

LEM 


From Ref. 1. 



Technique Application 

Low-energy electron diffraction Surface structure 

Scanning electron microscopy Surface morphology 

Electron microprobe Surface region composition 

Auger electron spectroscopy Surface layer composition 

Transmission electron microscopy High-resolution structure 

Scanning TEM Imaging, X-ray analysis 

Electron energy loss spectroscopy Local small area composition 
Ion-scattering spectroscopy Surface composition 

Secondary ion mass spectroscopy Trace composition vs. depth 

Secondary neutral mass spectrometery Trace composition vs. depth 
Particle-induced X-ray emission Trace composition 

Scanning ion microscopy Surface characterization 

Rutherford backscattering Composition vs. depth 

X-ray fluorescence Composition (/im depth) 

X-ray diffraction Crystal structure 

X-ray photoelectron spectroscopy Surface composition 

Laser microprobe Composition of irradiated area 

Laser emission microprobe Trace element analysis 
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modern techniques employed in the characterization of electronic thin-film 
materials and devices is given in Table 6-1. Among their characteristics are the 
unprecedented structural resolution and chemical analysis capabilities over 
small lateral and depth dimensions. Some techniques only sense and provide 
information on the first few atom layers of the surface. Others probe more 
deeply, but in no case are depths much beyond a few microns accessible for 
analysis. Virtually all of these techniques require a high or ultrahigh vacuum 
ambient. Some are nondestructive, others are not. In common, they all utilize 
incident electron, ion, or photon beams. These interact with the surface and 
excite it in such a way that some combination of secondary beams of electrons, 
ions, or photons are emitted, carrying off valuable structural and chemical 
information in the process. A rich collection of acronyms has emerged to 
differentiate the various techniques. These abbreviations are now widely 
employed in the thin-film and surface science literature. 

General testing and analysis of thin films is carried out with equipment and 
instruments which are wonderfully diverse in character. For example, consider 
the following extremes in their attributes: 

1 . Size —This varies from a portable desktop interferometer to the 50-ft long 
accelerator and beam line of a Rutherford backscattering (RBS) facility. 

2. Cost —This ranges from the modest cost of test instruments required to 
measure electrical resistance of films to the approximate $1 million price 
tag of a commercial SIMS spectrometer. 

3. Operating Environment —This varies from the ambient in the measure¬ 
ment of film thickness to the 10“ ,0 -torr vacuum required for the measure¬ 
ment of film surface composition. 

4. Sophistication—hi one extreme is the manual scotch-tape film peel test for 
adhesion, and at the other is an assortment of electron microscopes and 
surface analytical equipment where operation and data gathering, analysis, 
and display are essentially computer-controlled. 

What is remarkable is that films can be characterized structurally, chemi¬ 
cally, and with respect to various properties with almost the same ease and 
precision that we associate with bulk measurement. This despite the fact that 
there are many orders of magnitude fewer atoms available in films. To 
appreciate this, consider AES analysis of a Si wafer surface layer containing 1 
at% of an impurity. Only the top 10-15 A is sampled, and since state-of-the-art 
systems have a lateral resolution of 500 A, the total measurement volume 
corresponds to (7 t/4)(500) 2 (15) = 3 X 10 6 A 3 . In Si this corresponds to about 
150,000 matrix atoms, and therefore only 1500 impurity atoms are detected in 
the analysis! Such measurements pose challenges in handling and experimental 
techniques, but the problems are usually not insurmountable. 
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This chapter will only address the experimental techniques and applications 
associated with determination of 

1. Film thickness 

2. Film morphology and structure 

3. Film composition 

These represent the common core of information required of all films and 
coatings irrespective of ultimate application. Within each of these three cate¬ 
gories, only the most important techniques will be discussed. Beyond these 
broad characteristics there are a host of individual properties (e.g., hardness, 
adhesion, stress, electrical conductivity, reflectivity, etc.), that are specific to 
the particular application. The associated measurement techniques will there¬ 
fore be addressed in the appropriate context throughout the book. 


6.2. Film Thickness 


6.2.1. Introduction 

The thickness of a film is among the first quoted attributes of its nature. The 
reason is that thin-film properties and behavior depend on thickness. Histori¬ 
cally, the use of films in optical applications spurred the development of 
techniques capable of measuring film thicknesses with high accuracy. In 
contrast, other important film attributes, such as structure and chemical 
composition, were only characterized in the most rudimentary way until 
relatively recently. In some applications, the actual film thickness, within 
broad limits, is not particularly crucial to function. Decorative, metallurgical, 
and protective films and coatings are examples where this is so. On the other 
hand, microelectronic applications generally require the maintenance of precise 
and reproducible film thicknesses as well as lateral dimensions. Even more 
stringent thickness requirements must be adhered to in optical applications, 
particularly in multilayer coatings. 

The varied types of films and their uses have generated a multitude of ways 
to measure film thickness. A list of methods mentioned in this chapter is given 
in Table 6-2 together with typical measurement ranges and accuracies. In¬ 
cluded are destructive and nondestructive methods. The overwhelming major¬ 
ity are applicable to films that have been prepared and removed from the 
deposition chamber. Only a few are suitable for real-time monitoring of film 
thickness during growth. We start with optical techniques, a subject that 
is covered extensively in virtually every book and reference on thin films 
(Refs. 2-4). 
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Table 6-2. A Summary of Selected Film Thickness Measurement Techniques 


Method 

Range 

Accuracy or 
Precision 

Comments 

Multiple-beam 

FET 

30-20,000 A 

10-30 A 

A step and reflective 
coating required 

Multiple-Beam 

FECO 

10-20,000 A 

2 A 

A step, reflective 
coating, and spectrometer 
required; accurate but 
time-consuming 

VAMFO 

800 A-10 jim 

0.02-0.05 % 

For transparent Films 
on reflective substrates; 
Nondestructive 

CARIS 

400 A-20 fj ,m 

10 A-0.1% 

For transparent Films; 
Nondestructive 

Step gauge 

500-15,000 A 

- 200 A 

Values for Si0 2 on Si 

Ellipsometry 

A few A to 
a few jim 

1 A 

Transparent Films; complicated 
mathematical analysis 

Stylus 

20 A to 

A few A to 

Step required; simple and 


no limit 

< 3% 

rapid 

Weight 

< 1 A to 


Accuracy depends on 

measurement 

no limit 


knowledge of Film density 

Crystal 

< 1 A to 

< 1 A to 

Nonlinear behavior at 

oscillator 

a few jim 

a few % 

larger Film thicknesses 


6.2.2. Optical Methods for Measuring Film Thickness 

Optical techniques for film thickness determinations are widely used for a 
number of reasons. They are applicable to both opaque and transparent films, 
yielding thickness values of generally high accuracy. In addition, measure¬ 
ments are quickly performed, frequently nondestructive, and utilize relatively 
inexpensive equipment. The single basic principle which most optical tech¬ 
niques rely on is the interference of two or more beams of light whose optical 
path difference is related to film thickness. The details of instrumentation 
differ, depending on whether opaque or transparent films are involved. 


6.2.2.I. Interferometry of Opaque Films 

Fringes of Equal Thickness (FET). For opaque films a sharp step down to 
a substrate plane must be first generated either during deposition through a 
mask or by subsequent etching. A neighboring pair of light rays reflected from 
the film-substrate will travel different lengths and interfere by an amount 
dependent on the step height. To capitalize on the effect, one uses, multiple- 
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Figure 6-1. (a) Schematic view of experimental arrangement required to produce 

multiple-beam Fizeau fringes, (b) Fringe displacement at step. 

beam interferometry, a technique developed by Tolansky (Ref. 5). This 
requires that the optical reflectance of both the film and substrate be very high 
as well as uniform. This is accomplished by evaporating a metal such as A1 or, 
better yet, Ag over both film and substrate. Interference fringes are generated 
by placing a highly reflective, but semitransparent, optically flat reference 
plate very close to the film-step-substrate region as shown in Fig. 6-la. The 
two highly reflective surfaces are tilted slightly off parallel, enabling the light 
beam to be reflected in a zig-zag fashion between them many times. A series of 
increasingly attentuated beams are now available for interference. This sharp¬ 
ens the resultant, so-called Fizeau fringes of equal thickness which can be 
viewed with a microscope. 

The condition for constructive interference is that the optical path difference 
between successive beams be an integral number of wavelengths or 

2 6 \ 

25 + - = n X. (6-1) 

27r 

Here 5 is the distance between film and flat, X is the wavelength of the 
monochromatic radiation employed, and n is an integer. The phase change 
accompanying reflection b is assumed to be the same at both surfaces and is 
taken to be ir because of the high film reflectances. Therefore, 


2 


X, 


( 6 - 2 ) 
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and the distance between maxima of successive fringes corresponds to 5 = X/2. 
The existence of the step now displaces the fringe pattern abruptly by an 
amount A proportional to the film thickness d. As indicated in Fig. 6-lb, the 
film thickness is given by 

A X 

d - -. 

fringe spacing 2 

For highly reflective surfaces, the fringe width is about 1 /40 of the fringe 
spacing. Displacements of about 1/5 of a fringe width can be detected. For the 
Hg green line (X = 5640 A) the resolution is therefore (l/40)(l/5)(5640/2) » 
14 A. The resolution and ease of measurement are, respectively, influenced by 
the fraction of incident light reflected (/?) and the fraction absorbed ( A) by the 
film overlying the step. Raising R from 0.9 to 0.95 reduces the fringe width 
by half, whereas high A values reduce the fringe intensity. 

Fringes of Equal Chromatic Order (FECO). Film preparation for mea¬ 
surement is identical to that for FET. Now, however, white , rather than 
monochromatic light, is employed, and the reflected light is spectrally analyzed 
by a spectrometer. Equation 6-1 still applies, but X is no longer single-valued; 
i.e., 

25 = n\ = (n + 1)X2 = = (* + /)X /+1 . (6-4) 

Adjacent lines correspond to different X’s and different orders, where n + / is 
the chromatic order of a given line. When a step of height d is present, then 

2(5 + d) - 25 = 2d = n AX, (6-5) 

where AX is the measurable wavelength shift of a fringe due to the resulting 
interference. To obtain the film thickness, we must know the order n. By Eq. 
6-4, n = X2 /(Xj - X^, where X 2 corresponds to the shorter wavelength. For 
fringes corresponding to X, and X 2 , 

Xa AX 

By analogy to Eq. 6-3 and Fig. 6-lb, it may be useful to think of X, - X 2 as 
the fringe spacing, and AX as the fringe displacement, but with distances 
measured in an optical spectrometer rather than in a microscope. In general, 
the FECO technique is capable of higher accuracy than FET, especially for 

o 

films that are very thin. The maximum resolution is about ±5 A but to attain 
this, precise positioning of the reference plate to align fringes is essential. 


( 6 - 6 ) 


(6-3) 
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6.2.2.2. Interferometry of Transparent Films . A perfectly suitable 
method for measuring the thickness of transparent films is to first generate a 
step, metalize the film-substrate, and then proceed with either the FET or 
FECO techniques previously discussed. However, transparent films are ideally 
suited for interferometiy because interference of light occurs naturally between 
beams reflected from the two film surfaces. This means that a step is no longer 
required. Since different interfaces (the air-film and film-substrate) are 
involved for the beams that interfere, precautions must be taken to account for 
phase changes on reflection. This subject is discussed at length in Chapter 11. 
The relevant Fig. 6-2 summarizes what happens when monochromatic light of 
wavelength X is normally incident on a transparent film-substrate combina¬ 
tion. If n x and n 2 are the respective film and substrate indices of refraction, 
the intensity of the reflected light undergoes oscillations as a function of the 
optical film thickness, or n x d. When n x > n 2 , then maxima occur at film 
thicknesses equal to 

X 3X 5X 
4 n x ’ Ariy ’ Ariy 




OPTICAL THICKNESS / WAVELENGTH 

Figure 6-2, Calculated variation of reflectance (on air side) with normalized thick¬ 
ness (rt,flf/X) for films of various refractive indices on a glass substrate of index 1.5, In 
Chapter 11 there is a fuller discussion of this figure. (From K. L. Chopra, Thin Film 
Phenomena , 1969, reproduced with permission from McGraw-Hill, Inc.). 
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Figure 6-3. Schematic of experimental arrangement in the VAMFO technique. (From 
W. A. Pliskin & S. J. Zanin in Handbook of Thin Film Technology , edited by L. I. 
Maissel and R. Glang, © 1970, with permission from McGraw-Hill Inc.). 

For values of d halfway between these, the reflected intensity is minimum. 
When n x < n 2 , a reversal in intensity occurs at the same optical film thick¬ 
ness. In Fig. 6-2 these results are shown for the case of a glass substrate 
(n 2 = 1.5) and for films in which n x is either greater or less than 1.5. In order 
to exploit these concepts for the measurement of film thickness, we must 
devise experimental arrangements so that the intensity oscillations can be 
revealed. 

VAMFO (Ref. 2). In the VAMFO (variable angle monochromatic fringe 
observation) method provision is made to vary the angle of incidence(Z) of 
light on the film as shown in Fig. 6-3. In this arrangement a monochromatic 
light filter is employed to select a single wavelength for detection; monochro¬ 
matic light can also be used. As the stage and sample are rotated, maxima 
(bright) and minima (dark) fringes are observed on the film surface. For a 
transparent film on an absorbing reflecting substrate, the film thickness is 
simply given by 

d = {N\co$e)/2n x . (6-8) 

Here, 0 is the angle of refraction in the film, and N is the fringe order, 
which is measured by counting successive minima starting from perpendicular 
incidence. More accurate d values are obtained when the more easily detected 
intensity minima are measured. Then N assumes half-integer values 1/2, 3/2, 
5/2, etc. The technique has the advantage of not requiring an optically flat 
substrate or a collimated light source. A disadvantage is that the refractive 
index of the film must be known at the wavelength of measurement. Other¬ 
wise, values must be assumed, and a series of successive approximations made 
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until predicted and measured intensity minima angles coincide. Corrections due 
to phase changes on reflection at the substrate must also be made. For further 
details, the reader is referred to the literature (Ref. 2) where applications to 
bilayer transparent films are also discussed. 

CARIS (Ref. 2). In the technique known as CARIS (constant-angle reflection 
interference spectroscopy), the wavelength of the incident light rather than the 
angle of observation is systematically varied. The radiation is reflected from 
the film into a spectrometer with fringes being formed as a function of 
wavelength. For homogeneous films the thickness is determined by 

AN f \\2 

^ - X^cos 0 ’ 

where AN f is the number of fringes between wavelengths of \ and X^ In 
applying Eq. 6-9, it is important to realize that n x varies with X. The 
dispersion is greatest in the ultraviolet for most materials leading to large 
errors in this region. As with VAMFO, /i, values, if unknown, can be initially 
assumed and then determined through successive approximations, taking into 
account phase changes accompanying reflection at interfaces. Electronic detec¬ 
tion methods extend the capability of CARIS to measurement of thickness in 
semiconductor films that are transparent in the infrared. Bilayer film thick¬ 
nesses can also be determined through complex analysis (Ref. 2). 

Step Gauges. If there is a particular need to frequendy measure the 
thickness of one kind of film, it may be advantageous to construct a step 
gauge. Films of different but independendy known thickness are deposited on 
the substrate of interest and are arrayed sequentially. Interference colors 
observed when the specimen film is examined in reflected light are matched to 
the color of the step gauge standard. For example, a step gauge for Si0 2 films 
on a Si wafer has proven to be useful in estimating film thickness to 
approximately 200 A. A simple way to prepare such a gauge is to etch a thick 
Si0 2 film in the shape of a wedge by slowly lifting the wafer from the etchant 
(dilute HF) in which it is immersed, at a constant velocity. 

Ellfpsometry (Refs. 2, 6). Also known as polarimetry and polarization 
spectroscopy, the technique of ellipsometry is a century old and has been used 
to obtain the thickness and optical constants of films. The method consists of 
measuring and interpreting the change of polarization state that occurs when a 
polarized light beam is reflected at non-normal incidence from a film surface. 
Shown in Figure 6-4 is the experimental arrangement for ellipsometer mea- 
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SUBSTRATE 



Figure 6-4. Experimental arrangement in ellipsometry. (From W. A. Pliskin & S. J. 
Zanin in Handbook of Thin Film Technology , edited by L. I. Maissel and R. Glang, 
© 1970, with permission from McGraw-Hill Inc.). 

surements. The light source is first made monochromatic, collimated, and then 
linearly polarized. Upon passing through the compensator (usually a quarter- 
wave plate), the light is circularly polarized and then impinges on the specimen 
surface. After reflection, the light is transmitted through a second polarizer 
that serves as the analyzer. Finally, the light intensity is judged by eye or 
measured quantitatively by a photomultiplier detector. The polarizer and 
analyzer are rotated until light extinction occurs. The extinction readings 
enable the phase difference ( A e ) and amplitude ratio (tan 0) of the two 
components of reflected light to be determined. From these, either the Film 
thickness or the index of refraction can be obtained. 

Space limitations do not allow for discussion of these relationships, but an 
appreciation of what is involved can be gained by referring to Chapter 11. 
There, only light normally incident on the Film-substrate is considered, and, 
therefore, the distinction between parallel and perpendicular polarizations 
vanishes. But for light incident at an angle the two components of reflectivity, 
r 11 and r 1 , are distinct and the ratio of their amplitudes is given by 

r 11 / r x = tan \pe ,Ae . (6-10) 

This fundamental equation of ellipsometry relates the Film and substrate indices 
of refraction, film thickness, and phase changes during reflection at the Film 
interfaces. Computer programs and graphical solutions exist to enable un¬ 
known n and d values to be extracted (Ref. 2). Provision can be made to 
accommodate partially absorbing Films and substrates. 

There has long been interest in ellipsometry as a tool in the semiconductor 
industry to monitor the in situ deposition and growth of thin Films on 
well-polished, reflecting substrates, such as Si and GaAs. Because of its 
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Figure 6-5. Ellipsometry measurement of (GaAJ)As transition in the same 
organometallic CVD system, (a) Experimental trajectory of A e and and comparison 
with theoretical growth model (solid line), with crosses corresponding to 100-A 
increments in thickness, (b) Refractive index (rr) and absorption coefficient ( k ) as a 
function of GaAs film thickness. (Reproduced with permission from Ref. 7, © 1987, 
Annual Reviews Inc.). 
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sensitivity in the submonolayer range and ability to function at high tempera¬ 
tures or pressures, through gases and liquids, etc., ellipsometry has been 
employed in plasma and CVD reactors, MBE equipment, and electrochemical 
cells. Advances in real-time data acquisition and reduction have enabled the 
measurement and control of thin-film growth rates. 

An interesting example of ellipsometry techniques in monitoring real-time 
epitaxial growth in the GaAs-GaAlAs system is shown in Fig. 6-5. The 
experiment was carried out in a CVD reactor at 600 °C containing flows of 
AsH 3 , (CH 3 ) 3 Ga, and (CH 3 ) 3 A1 gases past a GaAlAs film, illuminated by a 
He-Ne laser (6328 A) at a 71° angle of incidence. After the A1 species flow 
was stopped, growth of the GaAs film commenced. The resulting spiral 
trajectory of A e and \p with film thickness (or time) is shown together with 
changes in the optical constants of GaAs. Within 150 A the GaAlAs -► GaAs 
growth transition is apparently complete. The ability of ellipsometry to pre¬ 
cisely monitor optical property changes in very thin films makes it attractive in 
multilayer film growth and etching studies. 

6.2.3. Mechanical Techniques for Measuring Film Thickness 

6.2.3.7. Stylus-Method Profilometry . The stylus method consists of mea¬ 
suring the mechanical movement of a stylus as it is made to trace the 
topography of a film-substrate step. A diamond needle stylus with a tip radius 
of — 10 fxm serves as the electromagnetic pickup. The stylus force is 
adjustable from 1 to 30 mg, and vertical magnifications of a few thousand up 
to a million times are possible. Film thickness is directly read out as the height 
of the resulting step-contour trace. Several factors that limit the accuracy of 
stylus measurements are 

1. Stylus penetration and scratching of films. This is sometimes a problem 
in very soft films (e.g., In, Sn). 

2. Substrate roughness . This introduces excessive noise into the measure¬ 
ment, which creates uncertainty in the position of the step. 

3. Vibration of the equipment . Proper shock mounting and rigid supports 
are essential to minimize background vibrations. 

In modern instruments the leveling and measurement functions are com¬ 
puter-controlled. The vertical stylus movement is digitized, and the data can be 
processed to magnify areas of interest and yield best profile fits. Calibration 
profiles are available for standardization of measurements. The measurement 
range spans distances from 200 A to 65 and the vertical resolution is 
* 10 A. 
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Figure 6-6. Profilometer trace of crater in InP produced during raster scanning of 
12.5-keV Cs + ion beam across surface. (Courtesy of H. Luftman, AT&T Bell 
Laboratories.) 

One of the important applications of stylus measurements is to determine the 
flatness and depth of a sputtered crater during depth-profiling analysis by AES 
or SIMS. In this technique a circular region of the film surface is sputtered 
away and an electron (AES) or ion (SIMS) probe beam ideally analyzes the flat 
bottom of the crater formed. Such a crater profile generated during SIMS 
analysis is shown in Fig. 6-6, where the total depth sputtered exceeds 2.5 pm. 
Since the sputtering times are known, this information can be converted to an 
equivalent depth scale for use in determination of precise concentration pro¬ 
files. If the crater walls are slanted rather than vertical, the analyzing beam 
may not sample a well-defined flat surface but some portion of the sidewall. 
This leads to errors in the concentration depth profile that should be corrected 
for. 
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6.2.3.2. Weight Measurement Measurement of the weight of the film 
deposit appears, at first glance, to be an easy direct way to determine film 
thickness d. Knowing the film mass m, the deposit area A , and film density 
Py y we have 

d = m / Ap f . (6-11) 


This simple method has been often used in ill-equipped laboratories where 
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precision mass balances are more common than interferometers or stylus 
instruments. Values of d so obtained are imprecise because the film density is 
not known with certainty. The reason is that the film packing factor P, a 
measure of the void content, can be quite low; e.g., P » 0.75 for porous 
deposits. If handbook bulk values of p are used in Eq. 6-11, d would, of 
course, be underestimated. Furthermore, in cases where the substrate contains 
a great deal of relief in the form of roughness, cleavage steps, patterned 
topography, etc., the effective deposit area will be larger than the assumed 
projected area. In this case, the film thickness may be overestimated. For 
ultrathin films possessing an island structure, this method, as well as others 
noted previously, are problematical. 

Even though gravimetric techniques have disadvantages, very delicate and 
novel microbalances have been constructed and widely employed to monitor 
film thickness during deposition. Microbalance designs have relied on such 
principles as the elongation of a thin quartz-fiber helix, the torsion of a wire, 
or the deflection of a pivot-mounted beam. Sensitive optical and, more 
commonly, electromechanical transducers and compensators for null measure¬ 
ments have been employed, enabling detection of — 10” 8 g. By utilizing very 
light, large-area substrates, we can measure deposits fractions of a monolayer 

o 

thick. Typical equivalent film thicknesses of less than 10 A for low-density 

o 

materials (e.g., Si0 2 ) and 1 A for high-density metals (e.g., Pt) are detectable. 
Microbalances made almost entirely of quartz can be degassed at elevated 
temperatures, making them suitable for ultrahigh-vacuum operation. The most 
important gravimetric technique involves the use of quartz crystal oscillators. It 
is to this almost universally employed technique for in situ monitoring of the 
thickness of physical vapor-deposited films that we now turn our attention. 

6.2.3.3. Crystal Oscillators (Refs. 4, 8). Homogeneous elastic plates set 
into mechanical vibration have resonant frequencies that depend on their 
dimensions, elastic moduli, and, importantly, density. Additional mass in the 
form of a deposited thin film alters (lowers) the resonant frequency by 
effectively changing the properties of the composite vibrating plate. This is the 
principle that underlies the use of crystal oscillators to measure film thickness. 
In this method an AT quartz crystal, i.e., cut — 35° with respect to the c axis, 
containing metal film electrodes on both wide faces is mounted within the 
deposition chamber close to the substrate. The fundamental frequency / of the 
shear mode is given by 

f=v q /2d q , (6-12) 

where v q is the elastic wave velocity and d q is the plate thickness; d q is also 
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equal to half the wavelength of the transverse wave. If mass dm deposits on 
one of the crystal electrodes, the thickness increases by an amount given by 
Eq. 6-11. Combination of these two equations yields a frequency change given 
by 


df= - 


f 2 dm 
Cp f A 


(6-13) 


where C = v q /2 is defined as the frequency constant whose value is 1656 
kHz-mm in AT cut quartz. In deriving Eq. 6-13, we have assumed that the 
addition of a small foreign film mass can be treated as an equivalent mass 
change of the quartz crystal. The formula is not rigorously correct because the 
elastic properties of the film are not the same as those of quartz, and A is 
generally not equal to the total crystal face area. These effects greatly 
complicate the frequency-response analysis. Nevertheless, as long as the 
accumulated mass deposited on the crystal does not shift the resonant fre¬ 
quency by a few percent of its original value, df varies linearly with dm. 

To appreciate the kind of numbers involved, we note that a 6-MHz crystal is 
commonly employed. Since a frequency shift of 1 Hz is readily measurable, 
Eq. 6-13 reveals that this is equivalent to a mass of A1 = 1.24 X 10 -8 g, and 
if A = 1 cm 2 , to a film thickness of 0.46 A. This sensitivity is suitable for 
most applications. It can be enhanced, however, by more than an order of 
magnitude by employing thinner crystals with higher resonant frequencies and 
by detecting smaller frequency shifts. 

The change in frequency is usually measured by beating the crystal signal 
against that from a reference (undeposited) crystal and counting the frequency 
difference. Quartz crystal oscillators are commonly employed for the measure¬ 
ment of deposition rate rather than film thickness. Therefore, commercial rate 
monitors contain circuitry to mathematically differentiate the frequency change 
with respect to time, display the rate, and provide feedback to control the 
power delivered to evaporation heaters. In these functions it is essential to 
eliminate uncertainty in the frequency shift measurement. A potentially impor¬ 
tant source of error arises from the temperature increase of the crystal due to 
radiant heat exposure from the evaporant source, and from the heat of 
condensation liberated by depositing atoms. Typically, temperature increases 
of a few degrees Celsius above that of the reference crystal result in frequency 
shifts of 10-100 Hz that are equivalent to a mass change of 10 -7 to 10 -6 
g/cm 2 (Ref. 8). For this reason, crystals are enclosed in water-cooled shrouds 
having a small entrance aperture to sample the evaporant stream. Lastly, 
precise work requires a correction due to the geometry of the monitor relative 
to the substrate. 
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6 . 3 . Structural Characterization 


6.3.1. Introduction 

Several levels of structural information are of interest to the thin-film scientist 
and technologist in research, process development, and reliability and failure 
analysis activities. The first broadly deals with the geometry of patterned films 
where issues of lateral or depth dimensions and tolerances, uniformity of 
thickness and coverage, completeness of etching, etc,, are of concern. Beyond 
this, the film surface topography and morphology, including grain size and 
shape, existence of compounds, presence of hillocks or whiskers, evidence of 
film voids, microcracking or lack of adhesion, formation of textured surfaces, 
etc., are of concern. Somewhat more difficult to obtain, but crucial to 
microelectronic device fabrication and optical coating technology, are the 
cross-sectional views of multilayer structures where interfacial regions, sub¬ 
strate interactions, and geometry and perfection of electronic devices with 
associated conducting and insulating layers may be observed. 

Lastly, and most complex of all, are diffraction patterns, the crystallo¬ 
graphic information they convey, and the high-resolution lattice images of both 
plain-view and transverse film sections. Among the applications here are defect 
structures in films and devices, structure of grain boundaries, identification of 
phases, and a host of issues related to epitaxial structures—e.g., the crystallo¬ 
graphic orientations involved, direct imaging of atoms at interfaces, interfacial 
quality and defects, perfection of quantum well and strained-layer superlat¬ 
tices. The transmission electron microscope (TEM) is required for these 
applications, whereas those of the previous paragraph are normally addressed 
by the scanning electron microscope (SEM) and, occasionally, by the reflec¬ 
tion metallurgical microscope. There is an interesting distinction between the 
TEM and SEM. The former is a true microscope in that all image information 
is acquired simultaneously or in parallel. In the SEM, however, only a small 
portion of the total image is probed at any instant, and the image builds up 
serially by scanning the probe. Strictly speaking, the SEM is more like the 
scanning Auger electron and SIMS microprobes than a traditional microscope. 
In this section we treat only electron microscopy, a subject dealt with at length 
in the recommended references (Refs. 9, 10). We start with the SEM. 

6.3.2. Scanning Electron Microscopy 

Because seeing is believing and understanding, the SEM is perhaps the most 
widely employed thin-film and coating characterization instrument. A schematic 
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a 



ELECTRON ENERGY (eV) 


Figure 6-8. (a) Electron and photon signals emanating from tear-shaped interaction 

volume during electron-beam impingement on specimen surface, (b) Energy spectrum 
of electrons emitted from specimen surface, (c) Effect of surface topography on 
electron emission. 


other atomic electrons and to the lattice. Through continuous random scattering 
events, the primary beam effectively spreads and fills a teardrop-shaped 
interaction volume (Fig. 6-8a) with a multitude of electronic excitations. The 
result is a distribution of electrons that manage to leave the specimen with an 
energy spectrum shown schematically in Fig. 6-8b. In addition, target X-rays 
are emitted, and other signals such as light, heat, and specimen current are 
produced, and the sources of their origin can be imaged with appropriate 
detectors. 

The various SEM techniques are differentiated on the basis of what is 
subsequently detected and imaged. 


6.3.2. 1. Secondary Electrons . The most common imaging mode relies on 
detection of this very lowest portion of the emitted energy distribution. Their 
very low energy means they originate from a subsurface depth of no larger 
than several angstroms. The signal is captured by a detector consisting of a 
scintillator-photomultiplier combination, and the output serves to modulate the 
intensity of a CRT, which is rastered in synchronism with the raster-scanned 
primary beam. The image magnification is then simply the ratio of scan lengths 
on the CRT to that on the specimen. Resolution specifications quoted on 
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research quality SEMs are — 50 A. Great depth of focus enables images of 
beautiful three-dimensional quality to be obtained from nonplanar surfaces. 
The contrast variation observed can be understood with reference to Fig. 6-8c. 
Sloping surfaces produce a greater secondary electron yield because the 
portion of the interaction volume projected on the emission region is larger 
than on a flat surface. Similarly, edges will appear even brighter. Many 
examples of secondary electron SEM images have been reproduced in various 
places throughout the book. 


6.3.2.2. Backscatiered Electrons . Backscattered electrons are the high- 
energy electrons that are elastically scattered and essentially possess the same 
energy as the incident electrons. The probability of backscattering increases 
with the atomic number Z of the sample material. Since the backscattered 
fraction is not a very strong function of Z (varying very roughly as - 0.05Z 1/2 
for primary electron beams employed in the SEM), elemental identification is 



Figure 6-9. Secondary electron image and corresponding electron channeling pat¬ 
terns from (100) Si layers epitaxially regrown from the melt in the LEGO process 
(Section 7.5.2). The blurred channeling pattern coincides with zone of melt impinge¬ 
ment where solidification defects (dislocations) form. (Unpublished research—D. 
Schwarcz and M. Ohring.) 
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not feasible from such information. Nevertheless, useful contrast can develop 
between regions of the specimen that differ widely in Z. Since the escape 
depth for high-energy backscattered electrons is much greater than for low-en¬ 
ergy secondaries, there is much less topological contrast in the images. 

An interesting phenomenon that makes use of backscattered electrons is 
electron channeling. In a single-crystal specimen (e.g., an epitaxial film), some 
of the crystal planes are oriented properly for electron diffraction to occur. To 
exploit the effect, the incident beam is made to rock about the normal to the 
surface over a range of Bragg angles. Electrons channel between lattice planes 
and are scattered in the forward and reverse directions. These latter electrons 
are intercepted by a broad-area detector, producing electron channeling pat¬ 
terns. An epitaxial Si layer regrown from melted polysilicon gave rise to the 
characteristic channeling patterns in Fig. 6-9. Where regrowth is less perfect 
due to the presence of defects, the pattern is somewhat blurred. The technique 
is difficult to quantify but is capable of nondestructively probing the crys¬ 
tallinity of regions that are several microns in extent. 

6.3.2.3. Electron-Beam-Induced Current (EBIC). The EBIC mode is ap¬ 
plicable to semiconductor devices. When the primary electron beam strikes the 
surface, electron-hole pairs are generated and the resulting current is collected 
to modulate the intensity of the CRT image. The technique is useful in spatially 
locating subsurface defects and failure sites within a junction region. 

6.3.2.4. X-Rays . A SEM is like a large X-ray vacuum tube used in conven¬ 
tional X-ray diffraction systems. Electrons emitted from the filament (cathode) 
are accelerated to high energies where they strike the specimen target (anode). 
In the process, X-rays characteristic of atoms in the irradiated area are emitted. 
By an analysis of their energies, the atoms can be identified and by a count of 
the numbers of X-rays emitted the concentration of atoms in the specimen can 
be determined. This important technique, known as X-ray energy dispersive 
analysis (EDX), is discussed in more detail in Section 6.4.3. 


6.3.3. Transmission Electron Microscopy 

As the name implies, the transmission electron microscope is used to obtain 
structural information from specimens that are thin enough to transmit elec¬ 
trons. Thin films are, therefore, ideal for study, but they must be removed 
from electron-impenetrable substrates prior to insertion in the TEM. The two 
basic modes of TEM operation are differentiated by the schematic ray dia- 
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BACK FOCAL PLANE OF OBJECTIVE LENS 
(OBJECTIVE APERTURE 0.5 - 20 jim) 

FIRST INTERMEDIATE IMAGE PLANE 
(INTERMEDIATE APERTURE 5 - 50 p m) 
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SECOND INTERMEDIATE IMAGE PLANE 
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VIEWING SCREEN 


IMAGE DIFFRACTION 

MODE MODE 

Figure 6-10. Ray paths in the TEM under imaging and diffraction conditions. 
(Reprinted with permission from John Wiley and Sons, from G. Thomas and M. J. 
Goringe, Transmission Electron Microscopy of Materials, Copyright © 1979, John 
Wiley and Sons). 


grams of Fig. 6-10. Electrons thermionically emitted from the gun are acceler¬ 
ated to 100 keV or higher (1 MeV in some microscopes) and first projected 
onto the specimen by means of the condenser lens system. The scattering 
processes experienced by electrons during their passage through the specimen 
determine the kind of information obtained. Elastic scattering, involving no 
energy loss when electrons interact with the potential field of the ion cores, 
gives rise to diffraction patterns. Inelastic interactions between beam and 
matrix electrons at heterogeneities such as grain boundaries, dislocations, 
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second-phase particles, defects, density variations, etc., cause complex absorp¬ 
tion and scattering effects, leading to a spatial variation in the intensity of the 
transmitted beam. The generation of characteristic X-rays and Auger electrons 
also occurs, but these by-products are not usually collected. 

The emergent primary and diffracted electron beams are now made to pass 
through a series of post-specimen lenses. The objective lens produces the first 
image of the object and is, therefore, required to be the most perfect of the 
lenses. Depending on how the beams reaching the back focal plane of the 
objective lens are subsequently processed distinguishes the operational modes. 
Basically, either magnified images are formed or diffraction patterns are 
obtained as shown in Fig. 6-10. A discussion of the analysis of diffraction 
effects is well beyond the scope of this book. Some notion of the correlation 
between structure and diffraction patterns can be gained from the model of thin 
films presented in Section 5.7.3. 

Images can be formed in a number of ways. The bright-field image is 
obtained by intentionally excluding all diffracted beams and only allowing the 
central beam through. This is done by placing suitably sized apertures in the 
back focal plane of the objective lens. Intermediate and projection lenses then 
magnify this central beam. Dark-field images are also formed by magnifying a 
single beam; this time one of the diffracted beams is chosen by means of an 
aperture that blocks the central beam and the other diffracted beams. The 

o 

micrograph of alternating 40 A-wide films in the GaAs-Al 0 5 Ga 0 5 As superlat¬ 
tice structure shown in Fig. 6-11 is a dark-field image employing the 200 
diffracted beam. In both of these cases we speak of amplitude contrast because 
diffracted beams with their phase relationships are excluded from the imaging 
sequence. In a third method of imaging, the primary transmitted and one or 
more of the diffracted beams are made to recombine, thus preserving both their 
amplitudes and phases. This is the technique employed in high-resolution 
lattice imaging, enabling diffracting planes and arrays of individual atoms to be 
distinguished. The interface image of the epitaxial CoSi 2 film on Si (Fig. 1-4) 
is an example of this remarkable technique. Other examples of epitaxial film 
interfaces are shown in Figs. 7-16 and 14-17. 

The high magnification of all TEM methods is a result of the small effective 
wavelengths (X) employed. According to the de Broglie relationship, 

\=h/s/2mqV, (6-14) 

where m and q are the electron mass and charge, h is Planck’s constant, and 
V is the potential difference through which electrons are accelerated. Electrons 
of 100 keV energy have wavelengths of 0.037 A and are capable of effectively 
transmitting through about 0.6 fim of Si. 
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Figure 6-11- Dark-field TEM image of alternating 40-A-wide GaAs-Ga 0 5 Al 0 5 As 
superlattice films. Light bands contain Al. (Courtesy of S. Nakahara, AT&T Bell 
Laboratories. 


The ability to prepare thin vertical sections of integrated circuits for TEM 
observation is one of the most important recent advances in technique. If one 
can imagine the plane of this page to be the surface thinned for conventional 
TEM work, transverse imaging requires head-on thinning and viewing of the 
- 75-^m-thick page edge. What is involved is the transverse cleavage of a 
number of wafer specimens, bonding these in an epoxy button, and thinning 
them by mechanical grinding and polishing. Finally, the resulting thin disk is 
mounted in an ion-milling machine where the specimen is further sputter 
thinned by ion bombardment until a hole appears. In VLSI applications, many 
specimens must be simultaneously mounted to enhance the probability of 
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Figure 6-12. A cross-section TEM bright-field image of field effect transistor struc¬ 
ture after junction delineation. Junctions were formed by As implantation and diffusion. 
(Courtesy of R. B. Marcus, Bellcore Corp., Reprinted with permission from John 
Wiley and Sons, from R. B. Marcus and T. T. Sheng, Transmission Electron 
Microscopy of Silicon VLSI Circuits and Structures , Copyright © 1983, John Wiley 
and Sons). 

capturing images from desired circuit features. An example of the vertical 
section of a field effect transistor is shown in Fig. 6-12 (Ref. 11). 


6.3.4. X-Ray Diffraction 

X-ray diffraction is a very important experimental technique that has long been 
used to address all issues related to the crystal structure of bulk solids, 
including lattice constants and geometry, identification of unknown materials, 
orientation of single crystals, and preferred orientation of polycrystals, defects, 
stresses, etc. Extension of X-ray diffraction methods to thin films has not been 
pursued with vigor for two main reasons: First the great penetrating power of 
X-rays means that with typical incident angles, their path length through films 
is too short to produce diffracted beams of sufficient intensity. Under such 
conditions the substrate, rather than the film, dominates the scattered X-ray 
signal; thus, diffraction peaks from films require long counting times. Second, 
the TEM provides similar diffraction information with the added capability of 
performing analysis over very small selected areas. Nevertheless, X-ray meth¬ 
ods have advantages because they are nondestructive and do not require 
elaborate sample preparation or film removal from the substrate. 
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Figure 6-13. (a) Seeman-Bohlin diffraction geometry: / = effective location of 

X-ray source, 7 = angle of incidence, 0 = Bragg angle, (b) Schematic of layered Film 
specimen, (c) A selected portion of the diffraction pattern with reflections from Ag, Pb, 
and Pb 2 Au shown. (Courtesy of K. N. Tu, IBM T. J. Watson Research Laboratory, 
from Ref. 12). 

What is required for a workable X-ray method is to make the film appear to 
be thicker to the beam than it actually is. This can be done by employing a 
grazing angle of incidence 7 as shown in Fig. 6-13. Thus if 7 = 5°, the film 
is effectively 12 times thicker. The Seeman-Bohlin diffraction geometry is 
employed with the focal point of the X-ray source, film specimen, and detector 
slit all located on the circumference of one great circle. Each of the diffracted 
peaks (at different angles) are sequentially swept through as the X-ray detector 
moves along the circumference. All the while, a servomotor rotates the 
detector to keep it aimed at the specimen and preserve the overall focusing 
geometry. 

An example of the diffraction pattern obtained is shown in Fig. 6-13c. The 
specimen consists of consecutively evaporated polycrystalline films of Ag, Au, 
Ag, and Pb on fused quartz. After the composite structure was annealed at 200 
°C for 24 h, Pb 2 Au peaks emerged, indicating that Au atoms diffused through 
the Ag layer and then reacted with the Pb. Grain boundaries in Ag were the 
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likely diffusion pathways because no penetration of single-crystal Ag films by 
Au was observed. 


6 . 4 . Chemical Characterization 


6.4.1. Introduction 

We now focus on chemical characterization of thin films. This includes 
identification of surface and interior atoms and compounds, as well as their 
lateral and depth spatial distributions. To meet these needs, we use an 
important subset of the analytical techniques listed in Table 6-1. Space 
limitation will restrict the discussion to include only the most popular methods 
(EDX, AES, XPS, RBS, and SIMS) and variants based on these. The justifica¬ 
tion for selecting these and not others is that they, together with the SEM 
and TEM, form the core of the diagnostic facilities associated with all phases 
of the research, development, processing, reliability, and failure analysis of 
thin-film electronic devices and integrated circuits. In VLSI technology 
some of these methods have gained wide acceptance as support tools for 
manufacturing lines. In addition, all of the associated equipment for these 
techniques is now commercially available, albeit at high cost. All excellent film 
characterization laboratories are outfitted with the total complement of this 
equipment. 

Table 6-3 will assist the reader to distinguish among the various chemical 
analytical methods. The capabilities and limitations of each are indicated, and 
the comparative strengths and weaknesses for particular analytical applications 
can, therefore, be assessed. The following remarks summarize several of these 
distinctions: 

1. AES, XPS, and SIMS are true surface analytical techniques, since the 
detected electrons and ions are emitted from surface layers less than - 15 

o 

A deep. Provision is made to probe deeper, or depth profile, by sputter¬ 
etching the film and continuously analyzing the newly exposed surfaces. 

2. EDX and RBS generally sample the total thickness of the thin film (- 1 
fi m) and frequently some portion of the substrate as well. Unlike RBS with 
a depth resolution of — 200 A, EDX has little depth resolution capability. 

3. AES, XPS, and SIMS are broadly applicable to detecting, with few 
exceptions, all of the elements in the periodic table. 

4. EDX can ordinarily only detect elements with Z > 11, and RBS is re¬ 
stricted to only selected combinations of elements whose spectra do not 
overlap. 
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Table 6-3. Summary of Major Chemical Characterization Techniques 




Detection 




Elemental 

Limit 

Lateral 

Effective 

Method 

Sensitivity 

(at%) 

Resolution 

Probe Depth 

Scanning Electron 

Na-U 

~ 0.1 

- 1 /Am 

~ 1 /tm 

microscope - energy 
dispersive x-ray 
(SEM/EDX) 

Auger Electron 

spectroscopy 

(AES) 

Li-U 

~ 0.1-1 

500 A 

- 15 A 

X-Ray 

Photoelectron 

Li-U 

- 0.1-1 

~ 100 /zm 

- 15 A 

spectroscopy 

(XPS) 

Rutherford 

Backscattering 

(RBS) 

He-U 

- 1 

1 mm 

— 200 A 

Secondary 

H-U 

1 

o 

1 

- 1 /Am 

15 A 

ion mass 





spectrometry 

(SIMS) 






5. The detection limits for AES, XPS, EDX, and RBS are similar, ranging 
from about — 0.1 to 1 at%. On the other hand, the sensitivity of SIMS is 
much higher and parts per million can be detected. Even lower concentra¬ 
tion levels (~ 10 -6 at%) are detectable in certain instances. 

6. Quantitative chemical analysis with AES and XPS is problematical with 
composition error bounds of several atomic percent. EDX is better and 
SIMS significantly worse in this regard. Composition standards are essen¬ 
tial for quantitative SIMS analysis. 

7. Only RBS is quantitatively precise to within an atomic percent or so from 
first principles and without the use of composition standards. It is the only 
nondestructive technique that provides simultaneous depth and composition 
information. 

8. The lateral spatial resolution of the region over which analyses can be 
performed is highest for AES ( — 500 A) and poorest for RBS (~ 1 mm). 
In between are EDX (~ 1 ^m), SIMS (several /*m), and XPS (—0.1 mm). 
AES has the distinction of being able to sample the smallest volume for 
analysis. 
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9. Only XPS, and to a much lesser extent AES, are capable of readily 
providing information on the nature of chemical bonding and valence states. 

The preceding characteristics earmark certain instruments for specific tasks. 
Suppose, for example, a film surface is locally discolored due to contamina¬ 
tion, or contains a residue, and it is desired to identify the source of the 
unknown impurities. Assuming access to all instruments at equal cost, AES 
and EDX would be the techniques of choice. If only ultrathin surface layers are 
involved, EDX would probably be of little value. The presence of trace 
elements would necessitate the higher sensitivity of SIMS analysis. If prelimi¬ 
nary examination pointed to the presence of Cl from an etching process, then 
evidence of the actual chemical compound formed would be obtained from 
XPS measurements. In a second example, a broad-area, thin-film metal bilayer 
structure is heated. Here we know which elements are initially present, but 
wish to determine the stoichiometry of intermetallic compounds formed as well 
as their thicknesses. This information is without question most unambiguously 
provided by RBS methods. 

In what follows, the various techniques are considered individually where 
additional details of instrumentation, aspects of particular capabilities and 
limitations, and applications will be presented. First, however, it is essential to 
appreciate the scientific principles underlying each type of analysis. More 
detailed discussions of these characterization techniques are given in Refs. 
13-19. 


6.4.2. Electron Spectroscopy 

We start with a discussion of atomic core electron spectroscopy since it is the 
basis for identification of the elements by EDX, AES, and XPS techniques. 
Consider the electronic structure of an unexcited atom schematically depicted 
in Fig. 6-14a. Both the K, L, M, etc., shell notation and the corresponding Is, 
2s, 2p, 3s, etc., electron states are indicated. Through excitation by an incident 
electron or photon, a hole or electron vacancy is created in the K shell (Fig. 
6-14b). 

In EDX an electron from an outer shell lowers its energy by filling the hole, 
and an X-ray is emitted in the process (Fig. 6- 14c). If the electron transition 
occurs between L and K shells, Ka X-rays are produced. Different X-rays are 
generated, e.g., X-rays from M-► K, and La X-rays from M -► L 
transitions. There are two facts worth remembering about these X-rays. 

1. The difference in energy between the levels involved in the electron 
transition is what determines the energy (or wavelength) of the emitted X-ray. 
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X-RAY EMISSION AUGER ELECTRON 

EMISSION 

Figure 6-14. Schematic of electron energy transitions: (a) initial state; (b) incident 
photon (or electron) ejects K shell electron; (c) X-ray emission when 2s electron Fills 
vacancy; (d) Auger electron emission. KLL transition shown. 

For example, 

= (6-15) 

A Ka, 

where h , c, and X have their usual meaning. 

2. The emitted X-rays are characteristic of the particular atom undergoing 
emission. Thus, each atom in the Periodic Table exhibits a unique set of K, L, 
M, etc., X-ray spectral lines that serve to unambiguously identify it. These 
characteristic X-rays are also known as fluorescent X-rays when excited by 
incident photons (e.g.. X-rays and gamma rays). 

There is, however, an alternative process by which the electron hole in Fig. 
6-14b can be filled. This involves a complex transition in which three, rather 
than two, electron levels, as in EDX, participate. The Auger process, which is 
the basis of AES, first involves an electron transition from an outer level (e.g., 
Lj) to the K hole. The resulting excess energy is not channeled into the 
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creation of a photon but is expended in ejecting an electron from yet a third 
level (e.g., L 2 ). As shown in Fig. 6-14d, the atom finally contains two 
electron holes after starting with a single hole. The electron that leaves the 
atom is known as an Auger electron, and it possesses an energy given by 

^KLL “ ~ El 2 = ~ *L 2 — ^L, ■ (6-16) 

The last equality indicates KL,L 2 and KL 2 L, transitions are indistinguishable. 
Similarly, other common transitions observed are denoted by LMM and MNN. 
Since the K, L, and M energy levels in a given atom are unique, the Auger 
spectral lines are characteristic of the element in question. By measuring the 
energies of the Auger electrons emitted by a material, we can identify its 
chemical makeup. 

To quantitatively illustrate these ideas, let us consider the X-ray and Auger 
excitation processes in titanium. The binding energies of each of the core 
electrons are indicated in Fig. 6-15, where electrons orbiting close to the 
nucleus are strongly bound with large binding energies. Electrons at the Fermi 
level are far from the pull of the nucleus and therefore taken to have zero 
binding energy, thus establishing a reference level. They would still have to 
acquire the work function energy to be totally free of the solid. Some notion of 
the rough magnitude of the core energy levels can be had from the well-known 
formula for hydrogen-like levels; i.e., 

£= 13.6Z 2 /n 2 (eV), (6-17) 

where Z is the atomic number and n is the principal quantum number. For Ti 
(Z = 22), the calculated energy of the K shell (n = 1) is 6582 eV. Complex 
electron-electron interactions and shielding of the nucleus makes this formula 
far too simplistic for multielectron atoms. Both effects reduce electron binding 
energies relative to Eq. 6-17. Several of the prominent characteristic X-ray 
energies and wavelengths for Ti are Ka { : E K - E L 3 = 4966.4 - 455.5 = 
4511 eV, X = 2.75 A; K 0] : E K - E My = 4966.4 - 34.6 = 4932 eV, X = 
2.51 A; La: E - E Mas = 455.5 - 3.7 = 452 eV, X = 27.4 A. Similarly, 
a prominent Ti Auger spectral transition is LMM or 

- *m 3 " *m 4 = 455.5 - 34.6 - 3.7 = 417 eV. 

The question may arise: When do atoms with electron holes undergo X-ray 
transitions, and when do they execute Auger processes? The answer is that 
both processes go on simultaneously. In the low-Z elements, the probability is 
greater that an Auger transition will occur, whereas X-ray emission is favored 
for high Z elements. The fractional proportions of the characteristic X-ray 
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X-RAY ENERGY, keV BINDING ENERGY eV 

Figure 6-15. Electron excitation processes in Ti: (a) energy-level scheme; (b) EDX 
spectrum of Ti employing Si(Li) detector; (c) AES spectral lines for Ti (dN(E)/dE 
vs. E); (d) a portion of the XPS spectrum for Ti (MgKa radiation). 
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Figure 6-16. Characteristic X-ray emission energies of the elements. (Courtesy of 
Princeton Gamma Tech, Inc.) 



Figure 6-17. Principal Auger electron energies of elements. (Courtesy of Physical 
Electronics Industries, Inc.) 

and Auger yields for K, L, M transitions can be found in standard references 
(Ref. 13). 

The variation of the principal characteristic X-ray and Auger lines with 
atomic number is shown in Figs. 6-16 and 6-17, respectively. Commercial 
spectrometers typically operate within the energy range spanned in these 
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figures. Therefore, in EDX, K X-ray transitions are conveniently measured in 
low-Z materials, and L series X-rays appear when high-Z elements are 
involved. Similarly, in AES, KLL and LMM transitions are involved for 
low-Z elements, and LMM and MNN lines appear for high-Z elements. 
Although keeping track of the particular shell involved is sometimes annoying, 
spectra from virtually all of the elements in the periodic table can be detected 
with a single excitation source and a single detector. Fortunately, the resolu¬ 
tion of X-ray or electron detectors is such that prominent lines of neighboring 
elements do not seriously overlap. This facilitates spectral interpretation and 
atomic fingerprinting. 

The basis for understanding XPS lies in the same atomic core electron 
scheme that we have been considering. Rather than incident electrons in the 
case of EDX and AES, relatively low-energy X-rays impinge on the specimen 
in this technique. The absorption of the photon results in the ejection of 
electrons via the photoelectric effect. Governing this process is the well-known 
equation expressed by 

^ke ~ hv — E b , (6-18) 

where ^KE’ hv> and E b are the energies of the ejected electron, incident 
photon, and the involved bound electron state. Since values of the binding 
energy are element-specific, atomic identification is possible through measure¬ 
ment of photoelectron energies. 

6.4.3. X-Ray Energy-Dispersive Analysis (EDX) 

6.4.3.7. Equipment Most energy-dispersive X-ray analysis systems are 
interfaced to SEMs, where the electron beam serves to excite characteristic 
X-rays from the area of the specimen being probed. Attached to the SEM 
column is the liquid-nitrogen Dewar with its cooled Si(Li) detector aimed to 
efficiently intercept emitted X-rays. The Si(Li) detector is a reverse-biased Si 
diode doped with Li to create a wide depletion region. An incoming X-ray 
generates a photoelectron that eventually dissipates its energy by creating 
electron-hole pairs. The incident photon energy is linearly proportional to the 
number of pairs produced or equivalently proportional to the amplitude of the 
voltage pulse they generate when separated. 

The pulses are amplified and then sorted according to voltage amplitude by a 
multichannel analyzer, which also counts and stores the number of pulses 
within given increments of the voltage (energy) range. The result is the 
characteristic X-ray spectrum shown for Ti in Fig. 6-15. Si(Li) detectors 
typically have a resolution of about 150 eV, so overlap of peaks occurs when 
they are not separated in energy by more than this amount. Overlap sometimes 
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occurs in multicomponent samples or when neighboring elements in the 
periodic table are present. 

Several variants of X-ray spectroscopy are worth mentioning. In X-ray 
wavelength-dispersive analysis (WDX), where wavelength rather than energy 
is dispersed, a factor of 20 or so improvement in X-ray linewidth resolution is 
possible. In this case, emitted X-rays, rather than entering a Si(Li) detector, 
are diffracted from single crystals with known interplanar spacings. From 
Bragg’s law, each characteristic wavelength reflects constructively at different 
corresponding angles, which can be measured with very high precision. As the 
goniometer-detector assembly rotates, the peak is swept through as a function 
of angle. The electron microprobe (EMP) is an instrument specially designed 
to perform WDX analysis. This capability is also available on an SEM by 
attaching a diffractometer to the column. The high spectral resolution of WDX 
is offset by its relatively slow speed. 

Characteristic X-rays can also be generated by using photons and energetic 
particles rather than electrons as the excitation source. For example, conven¬ 
tional X-ray tubes, and radioactive sources such as 24I Am (60-keV gamma ray, 
26.4-keV X-ray) and 109 Cd (22.1-keV Ag-K X-ray) can excite fluorescent 
X-rays from both thin-film and thick specimens. Unlike electron-beam sources, 
they have virtually no lateral spatial resolution. 

6.4.3.2. Quantification. Quantitative analysis of an element in a multicom¬ 
ponent matrix is a complicated matter. The expected X-ray yield, Y x (d ), 
originating from some depth d below the surface depends on a number of 
factors: I Q {d), the intensity of the electron beam at d\ C, the atomic concen¬ 
tration; a, the ionization cross section; w x , the X-ray yield; the X-ray 
absorption coefficient; and e, 12, and 0, the detector efficiency, solid angle, 
and angle with respect :o the beam, respectively. Therefore, 

Y x {d) ~ / 0 (tf)Cw x e-^ /cosfl frrfO/4ir, (6-19) 

and the total signal detected is the sum contributed by all atomic species 
present integrated over the depth range. It is sometimes simpler to calibrate the 
yields against known composition standards. Excellent computer programs, 
both standardless and employing standards, are available for analysis, and 
compositions are typically computed to approximately 0.1 at %. 

6.4.4. Auger Electron Spectroscopy (AES) 

6.4.4.I. Equipment The typical AES spectrometer, shown schematically 
in Fig. 6-18, is housed within an ultrahigh vacuum chamber maintained at 




Figure 6-18. Schematic of spectrometer with combined AES and XPS capabilities. (Courtesy of Physical 
Electronics Industries, Inc.) 
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— 10” 10 torr. This level of cleanliness is required to prevent surface coverage 
by contaminants (e.g., C, O) in the system. The electron-gun source aims a 
finely focused beam of — 2-keV electrons at the specimen surface, where it is 
scanned over the region of interest. Emitted Auger electrons are then energy - 
analyzed by a cylindrical (or hemispherical in some systems) analyzer. The 
latter consists of coaxial metal cylinders (or hemispheres) raised to different 
potentials. The electron pass energy E is proportional to the voltage on the 
outer cylinder, and the incremental energy range AE of transmitted electrons 
determines the resolution (A E/E), which is typically 0.2 to 0.5%. Electrons 
with higher or lower energies (velocities) than E either hit the outer or inner 
cylinders, respectively. They do not exit the analyzer and are not counted. By 
a sweep of the bias potential on the analyzer, the entire electron spectrum is 
obtained. Complete AES spectrometers are commercially available and cost 
about a half-million dollars. Auger electrons are but a part of the total electron 
yield, N(E), intermediate between low-energy secondary and high-energy 
elastically scattered electrons. They are barely discernible as small bumps 
above the background signal. Therefore, to accentuate the energy and magni¬ 
tude of the Auger peaks, the spectrum is electronically or numerically differen¬ 
tiated, and this gives rise to the common AES spectrum, or dN(E)/dE vs. E 
response, shown in Fig. 6-15c for Ti. The reader should verify that differentia¬ 
tion of a Gaussian-like peak yields the wiggly narrow double-peak response. 
By convention, the Auger line energy is taken at the resulting peak minimum. 

Two very useful capabilities for thin-film analysis are depth profiling and 
lateral scanning. The first is accomplished with incorporated ion guns that 
enable the specimen surface to be continuously sputtered away while Auger 
electrons are being detected. Multielement composition depth profiles can thus 
be determined over total film thicknesses of several thousand angstroms by 
sequentially sampling and analyzing arbitrarily thin layers. Although depth 
resolution is extremely high, the frequently unknown sputter rates makes 
precise depth determinations problematical. Through raster or line scanning the 
electron beam, the AES is converted into an SEM and images of the surface 
topography are obtained. By modulating the imaging beam with the Auger 
electron signal, we can achieve lateral composition mapping of the surface 
distribution of particular elements. Unlike EDX-SEM composition mapping, 
only the upper few atom layers is probed in this case. 


6.4.4.2. Quantification . The determination of the Auger electron yield 
from which atomic concentrations can be extracted is expressed by a formula 
similar in form to Eq. 6-19 for the X-ray yield. The use of external standards 
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is very important in quantifying elemental analysis, particularly because stan¬ 
dardless computer programs for AES are rather imprecise when compared with 
those available for EDX analysis. An approximate formula that has been 
widely used to determine the atomic concentration of a given species A in a 
matrix of m elements is 



£ his, 


i= 1 


( 6 - 20 ) 


The quantity /, represents the intensity of the Auger line and is taken as the 
peak-to-peak span of the spectral line. The relative Auger sensitivity S, also 
enters Eq. 6-20. It has values ranging from ~ 0.02 to 1 and depends on the 
element in question, the particular transition selected, and the electron-beam 
voltage. Uncertainties in C values so determined are perhaps a few atomic 
percent at best. 


6.4.5. X-Ray Photoelectron Spectroscopy (XPS) 

In order to capitalize on the X-ray-induced photoelectron effect, a spectrome¬ 
ter like the one used for AES and shown in Fig. 6-18 is employed. The only 
difference is the excitation source, which is now a beam of either Mg or A1 
Ka X-rays. These characteristic X-rays have relatively low energy (e.g., 
/i^ Mg = 1254 eV and hv M - 1487 eV) and set an upper bound to the kinetic 
energy of the detected photoelectrons. 



Figure 6-19. AES depth profiles of A1 and Ga through GaAs and Al v Ga,_ v As 
films. Signal for As not shown. (Courtesy of R. Kopf, AT&T Bell Laboratories.) 
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A portion of the XPS spectrum for Ti is shown in Fig. 6-15d, where 2s, 
2 Pi/ 2> and 2p 3/2 peaks are evident. Interestingly, characteristic Auger electron 
transitions (not shown) frequently appear at precisely the energy locations 
indicated in Fig. 6-15c. The XPS peak positions, however, are shifted slightly 
by a few electron volts from those predicted by Eq. 6-18 because of work 
function differences between the specimen and detector. 

It is beyond our scope to discuss spectral notation, the chemistry and physics 
of transitions, and position and width of the lines. What is significant is that 
linewidths are considerably narrower than those associated with Auger transi¬ 
tions. This fact makes it possible to gain useful chemical bonding information 
that can also be attained with less resolution by AES, but not by the other 
surface analytical techniques. It is for this reason that XPS is also known as 
electron spectroscopy for chemical analysis (ESCA). 

Effects due to chemical bonding originate at the valence electrons and ripple 
beyond them to alter the energies of the core levels in inverse proportion to 
their proximity to the nucleus. As a result, energy shifts of a few electron volts 
occur and are resolvable. For example, in the case of pure Ti, the 2p 3//2 line 
has a binding energy of 454 eV. In compounds this electron is more tightly 
bound to the Ti nucleus; apparently the electron charge clouds of the neighbor¬ 
ing atoms “repel” it. In TiC, TiN, and TiO, the same line is located at 
E b = 455 eV. Similarly, for the compounds Ti0 2 , BaTi0 3 , PbTi0 3 , SrTi0 3 , 
CaTi0 3 , and (C 2 H 5 ) 2 TiCl 2 the transition occurs between E g = 458 and 459 
eV. Clearly the magnitude of the chemical shift alone is not a sufficient 
condition to establish the nature of the compound. 

Aside from chemical bonding information, XPS has an important advantage 
relative to AES. X-rays are less prone to damage surfaces than are electrons. 
For example, electron beams can reduce hydrocarbon contaminants to carbon, 
destroying the sought-after evidence. For this reason, XPS tends to be pre¬ 
ferred in assessing the cleanliness of semiconductor films during MBE growth. 

6.4.6. A Couple of Applications in GaAs Films 

6.4.6.I. AES. As an example of AES, consider the depth profiles for Ga and 
A1 shown in Fig. 6-19. The structure represents a single-crystal GaAs substrate 
onto which a 2000-A thick, compositionally graded film of Al^Ga^^As was 
grown by molecular-beam epitaxy methods (Chapter 7). At first, the deposition 
rate of A1 was increased linearly while that of Ga correspondingly decreased 
until AlAs formed; then the deposition rates were reversed until the GaAs 
composition was attained after 2000 A. Finally, a 1000-A-cap film of GaAs 

o 

was deposited resulting in a 2000-A-wide, V-shaped quantum well sandwiched 
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between GaAs layers. Because the lattice constants of GaAs, AJAs, and the 
intermediate Al^Ga^^As compositions differ by only 0.16% at most, the 
entire “lattice-matched'’ structure is free of crystallographic defects. During 
AES depth profiling, the film is sputtered away, exposing surface compositions 
in reverse order to those that were initially deposited. The linearly graded 
walls of the quantum well reflect the precise control of deposition that is 
possible. In this case, the values of 5 a, and ^Ga are apparently independent of 
composition. 


6.4.6.2. XPS. Here we consider an AlGaAs film that has been etched in a 
CF 2 C1 2 + 0 2 plasma in order to fabricate devices. It is desired to determine 
the composition of the film surface relative to that of the underlying material. 
But both electron impingement and ion bombardment during sputter depth 
profiling would alter or even remove surface compounds. What is needed is a 
technique to probe surface layers nondestructively. Angle-resolved XPS is 
such a method. It is based on altering the takeoff angle for electron detection. 
If the angle (0) that exiting photoelectrons make with the surface plane is 
large, then chemical information on deep surface layers is sampled. However, 
if electrons exiting at a small grazing angle are detected, then only the top 
surface layers are probed; photoelectrons generated within deeper layers 
simply never emerge because their effective range ( l e ) is smaller than the now 
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Figure 6-20. Angle-resolved XPS spectra of Ga 3d line as a function of electron 
detection angle. (Courtesy of M. Vasile, AT&T Bell Laboratories.) 
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longer geometric escape-path length. In fact, the signal intensity from atoms at 
depth jc varies as / = 7 0 exp - x/l e c os 0. 

In the analysis shown in Fig. 6-20, 0 was varied from 90° to 20° while the 
Ga 3d peak was scanned. At the surface, bonding associated with GaF 3 and 
Ga 2 0 3 compounds was detected. Deeper within the film only Ga bound within 
GaAs or AlGaAs is evident. 

6.4.7. Rutherford Backscattering (RBS) (Refs. 17, 20) 

G.4.7.1. Physical Principles . This popular thin-film characterization tech¬ 
nique relies on the use of very high energy (MeV) beams of low mass ions. 
These have the property of penetrating thousands of angstroms or even 
microns deep into films or film-substrate combinations. Interestingly, such 
beams cause negligible sputtering of surface atoms. Rather, the projectile ions 
lose their energy through electronic excitation and ionization of target atoms. 
(For further discussion see Section 13.4.2.) These “electronic collisions” are 
so numerous that the energy loss can be considered to be continuous with 
depth. Sometimes the fast-moving light ions (usually 4 He + ) penetrate the 
atomic electron cloud shield and undergo close-impact collisions with the 
nuclei of the much heavier stationary target atoms. The resulting scattering 
from the Coulomb repulsion between ion and nucleus has been long known in 
nuclear physics as Rutherford scattering. The primary reason that this phe¬ 
nomenon has been so successfully capitalized upon for film analysis is that 
classical two-body elastic scattering is operative. This, perhaps, makes RBS 
the easiest of the analytical techniques to understand. 

We start by first considering an incident ion of mass (atomic weight) M 0 
and energy E 0 incident on a surface as shown in Fig. 6-21. An elastic collision 
between the ion projectile and a surface atom of mass M occurs such that 
afterward the ion energy is E x . The collision is insensitive to the electronic 
configuration or chemical bonding of target atoms, but depends solely on the 
masses and energies involved. As a consequence of conserving energy and 
momentum, it is readily shown that 


where 0 is the scattering angle. For a particular combination of Af 0 , Af, and 
0, the simple formula 

E\ = K m E, 


(A/ 2 - A/gSin 2 0)^ 2 -I- Af 0 cos 0 


Afo + M 


( 6 - 21 ) 


( 6 - 22 ) 
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x = 900 A x = 0 


Figure 6-21. (a) Geometry of scattering and notation of energies at the front and 

back surfaces of a 900-A-thick PtSi film, (b) 4 He + ion energy as a function of film 
depth as a result of scattering from Pt and Si. Schematic RBS spectrum shown rotated 
by 90°. (Reprinted with permission from Elsevier Sequoia S.A., from W. K, Chu, 
J. W. Mayer, M. A. Nicolet, T. M. Buck, G. Amsel, and F. Eisen, Thin Solid Films 
17, 1, 1963). 

relates the energy of emergent ions to that of the incident ions. The term K M 
is known as the kinematic factor and can be calculated from Eq. 6-21. Once 
the incident ion, e.g., 4 He + (A/ 0 = 4) at E 0 = 2 MeV, and angular position of 
the ion detector are selected (0 is typically 170°), K M just depends on the 
atomic weight of the target atom. For example, under the conditions just given, 
/fpt = 0.922 and AT Si = 0.565. This means that if a 2-MeV He ion collides 
with a Pt atom (M = 195) located on the outer surface of a PtSi film, it will 
backscatter into the detector with an energy of 1.844 MeV; similarly, He ions 
scattering from surface Si atoms will have their energy reduced to 1.130 MeV. 
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Consider now what happens when a 2-MeV 4 He + ion beam impinges on a 

o 

900-A platinum-silicide film on a Si wafer substrate. In Fig. 6-2la the 
film-substrate geometry is shown together with a schematic of the energy 
changes the ions undergo during nuclear scattering and the corresponding RBS 
spectrum (Fig. 6-21b). Energy changes (Zs 0 E x ) for scattering from Pt and 
Si surface atoms are shown on the x = 0 axis. The majority of the He ions 
penetrate below the film surface, however, where they continuously lose 
energy at a linear rate (E 0 -> E 2 ) with distance traversed. At any film depth 
they can suffer an atomic collision. The scattered ion energy is still given by 
Eq. 6-22, but E 0 is now the incident 4 He + energy at that point in the matrix. 
Some of the energy-attenuated 4 He + ions can reach the PtSi-Si interface 
where they may backscatter (E$ — K M E 2 ) and again lose energy (£3 -> E 4 ) 
in traversing the film backward until they finally exit. Other 4 He + ions can 
even penetrate into the Si substrate where they eventually backscatter. It is 
important to realize that in the course of passage through the film, the ion 
beam can be thought of as splitting into two elemental components, each 
spanning a different range of energies. For each broad elemental peak de¬ 
tected, the highest and lowest energies correspond to atoms on the front and 
back film surfaces, respectively. Statistics largely govern the depth at which 
scattering occurs and whether Pt or Si atoms participate in the collisions. 

Thus, after measurement of the number and energy of backscattered He 
ions, information on the nature of the elements present, their concentration, 
and depth distribution can all be simultaneously determined without apprecia¬ 
bly damaging the specimen. 


6.4.7.2. Equipment A schematic of the experimental arrangement em¬ 
ployed for RBS is shown in Fig. 6-22. Lest the reader be deceived by the 
figure, it should be appreciated that the actual facility shown in Fig. 6-23 is 
some 15 m long and occupies more than 100 m 2 of floor space. Ions for 
analysis (e.g., 4 He, 12 C, 4 N, etc.) are accelerated by the high voltage 

generated by the Van De Graaff accelerator. After entering the evacuated 
(~ 10^ 6 —10 7 torr) beam line, the ions are then collimated and focused. Mass 
selection occurs in the bending magnet, which geometrically disperses ions 
according to their mass. The resultant ion beam is then raster-scanned across 
the surface of the specimen. Backscattered ions are analyzed with respect to 
their energy by a silicon surface barrier detector capable of an energy 
resolution of —15 keV (peak width at half-maximum amplitude). The elec¬ 
tronic pulses are then amplified and sorted according to voltage amplitude 
(i.e., energy) by a multichannel analyzer to yield the resulting RBS spectrum. 
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Figure 6-22. Schematic of the 1.7-MeV tandem accelerater, RBS facility at AT&T Bell 
Laboratories, Murray Hill, NJ. 
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Figure 6-23. Photograph of 1.7-MeV tandem accelerater-RBS facility. (Courtesy of 
D. C. Jacobson, AT&T Bell Laboratories.) 


6.4.7.3. Capabilities and Limitations 

Elemental Information. All elements and their isotopes including Li and 
those above it in the periodic table are, in principle, detectable with 4 He + ions. 
The critical test is how well neighboring elements are resolved, and this 
ultimately depends on the detector resolution. With 2-MeV 4 He + , isotopes 
with AM = 1 can generally be separated for M below approximately 40. At 
values of M « 200 only atoms for which AM > 20 can be resolved. Thus, 
Bi and Os would be indistinguishable. The apparent advantage in separat¬ 
ing low-Z elements is offset by their low cross sections (o,) for scattering. The 
o i are a measure of how efficiently target atoms scatter incoming ions and 
depend on Z f -, particle energy E, the masses involved, and the angle of 
scattering. To a good approximation, their dependence on these quantities 
varies as 


a = 


Z\ ^2 

4 E 




(6-23) 


where the term in brackets is an important correction for low-mass targets. 
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Clearly, high-Z elements produce a stronger backscattered signal than low-Z 
elements. 

Consideration of these factors suggests that specimens for RBS analysis 
should ideally contain elements of widely different atomic weight stacked with 
the heavy atoms near the surface and the lighter atoms below them. In bilayer 
film structures it is desirable for the high-A/ film to be at the outer surface. 
Otherwise there is the danger that separate elemental peaks will overlap. 

In general, when applicable, RBS can detect concentration levels of about 1 
at%. The technique is unmatched in determining the stoichiometry of thin-film 
binary compounds such as metal silicides where accuracies of - ± 1 % or so 
are achieved. 

6.4.7.4. Spatial and Depth Resolution . Since MeV ion beams can only be 
focused to a spot size of a millimeter or so in diameter, the lateral spatial 
resolution of RBS is not great. The depth resolution is commonly quoted to be 
200 A. This can be improved to 20 A by altering the geometry of detection. 
Grazing exit angles are employed to make the film appear to be effectively 
thicker. For example, when 0 = 95°, corresponding to an exit angle of 5°, ion 
scattering at a given film depth means that the energy loss path length is an 
order of magnitude longer. Implicit in the use of RBS is the desirability that the 
specimen surface and underlying layered structures be precisely planar. Fortu¬ 
nately, polished Si wafers are extraordinarily flat. Films grown or deposited on 
Si maintain this planarity and are thus excellently suited for RBS analysis. 
Films with rough surfaces yield broadened RBS peaks. 

The maximum film depth that can be probed depends on the ion used, its 
energy, and the nature of the matrix. Typically, - 1 pm is an upper limit for 
2-MeV 4 He + . On the other hand, 3 H + beams of 2 MeV penetrate - 5 pm 
deep in Si. 

6.4.7.5. Quantification . Despite the limitations noted, RBS enjoys the sta¬ 
tus of being the preferred method of analysis in situations where it is 
applicable. The basic reasons are that it is quantitative from first principles, 
does not require elemental standards, and yields simultaneous depth and film 
thickness information. Clearly, the area under a spectral peak represents the 
total number of atoms of a given element present within some continuous 
region or layer. The peak height ( H ) is directly proportional to the atomic 
concentration. The peak width (AZT) depends on the maximum length tra¬ 
versed by projectile ions in the layer. Therefore, AE is directly proportional to 
the layer or film thickness if the ion energy attenuation with distance is known. 
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Figure 6-24. Energy spectrum for 2-MeV He + ions backscattered from 900 A of 
PtSi on a Si substrate. (Reprinted with permission from Elsevier Sequoia S.A., from 
W. K. Chu, J. W. Mayer, M. A. Nicolet, T. M. Buck, G. Amsel, and F. Eisen, Thin 
Solid Films 17, 1 , 1963). 


Particle range equations derived for nuclear physics applications yield this 
information to a high degree of accuracy. Hence, RBS is a useful way to 
determine Film thicknesses. 

As an exercise in chemical analysis, let us consider the experimental PtSi 
spectrum (Fig. 6-24) and calculate the relative proportions of Ft and Si present. 
By analogy to Eq. 6-20, the concentration ratio is given by 


Cpt _ ^ pi / a pt 
^Si ^ Si / a Si 


(6-24) 


where A i are the peak areas and the scattering cross sections are, in effect, 
sensitivity factors. For PtSi, A^/A^ is measured to be 32, and a Pt /a Si is 
estimated to be equal io /Z Sl ) 2 - 31.8, so that C Pt /C Sl = 1.01. Alterna¬ 
tively, since A i = //, AE,, we have 

^Pt _ °Si ^Pt A E Px 

Qsi J Pt "s, A *si 


(6-25) 
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Experimental measurement reveals that H^/H^ = 27.5 and A£’ Pt /A£' Si = 
1.15, and, therefore, /C Si is calculated to be 0.99. 

Although film thickness can be determined to within 5%, the procedure is 
somewhat involved. The required ion energy loss with distance data, expressed 
as stopping cross sections, are tabulated and can be calculated for any matrix 
(Ref. 13). Film thicknesses can, in principle, also be determined either from 
known total incident charge and detector solid angle, or through the use of 
thickness standards. Experimental difficulties, however, limit the utility of 
these latter methods in practice. 


6.4.7.6. Channeling . An interesting effect known as channeling can greatly 
extend the depth of ion penetration into specially oriented single-crystal film 
matrices. To understand why, imagine viewing a ball-and-spoke model of a 
diamond crystal structure along various crystal directions. In many orienta¬ 
tions, the model appears impenetrable to impinging ions. But along the [110] 
direction, a surprisingly large hexagonal tunnel is exposed through which ions 
can deeply penetrate by undergoing glancing, zig-zag collisions with the tunnel 
wall atoms. The ion trajectories simply do not bring them close enough to 
target atoms where they can undergo the nuclear collisions that are particularly 
effective in slowing them down. Rather, these channeled ions lose energy 
primarily by electronic excitation of the lattice and therefore range further than 
if the matrix were, say, amorphous. 

Channeling effects can be used advantageously to distinguish RBS spectral 
features when crystalline, polycrystalline, and amorphous film layers coexist. 
If, for example, an amorphous Si (a-Si) film covers underlying crystalline Si 
(c-Si) the 4 He + yield from Si would abruptly drop to the c-Si value at the a-c 
interface (see Fig. 13-9). However, during ion implantation doping of semi¬ 
conductors, channeling effects are unwelcome because the resulting dopant 
profiles are modified in ways not easily predicted. 


6.4.8. Secondary Ion Mass Spectrometry (SIMS) (Ref. 18, 21) 

The mass spectrometer, long common in the chemistry laboratory for the 
analysis of gases has been dramatically transformed in recent years to create 
SIMS apparatus capable of analyzing the chemical composition of solid surface 
layers. A critical need to measure thermally diffused and ion-implanted depth 
profiles of dopants in semiconductor devices spurred the development of 
SIMS. In typical devices, peak dopant levels are about 10 20 /cm 3 while 
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background levels are 10 l5 /cm 3 . These correspond to atomic concentrations in 
Si of 0.2% to 2 x 10 _6 %, respectively. None of the analytical techniques 
considered thus far has the capability of detecting such low concentration 
levels. The price paid for this high sensitivity is an extremely complex 
spectrum of peaks corresponding the the masses of detected ions and ion 
fragments. This necessitates the use of standards, composed of the specific 
elements and matrices in question, for quantitative determinations of composi¬ 
tion. 

In SIMS, a source of ions bombards the surface and sputters neutral atoms, 
for the most part, but also positive and negative ions from the outermost film 
layers. Once in the gas phase, the ions are mass-analyzed in order to identify 
the species present as well as determine their abundance. Since it is the 
secondary ion emission current that is detected in SIMS, high-sensitivity 
analysis requires methods for enhancing sputtered-ion yields. Secondary ion 
emission may be viewed as a special case of (neutral atom) sputtering. 
However, a comprehensive theory to quantitatively explain all aspects of 
secondary ion emission (e.g., ion yields S + and S', escape velocities and 
angles, dependence on ion projectile and target material, etc.) does not yet 
exist. Reliable experiments to test proposed theories are difficult to perform. 
Experimentally, it has been found that different ion beams interact with the 
specimen surface in profoundly different ways. For example, the positive 
metal ion yield of an oxidized surface is typically enhanced 10-fold and 
frequently more relative to a clean surface. This accounts for the common 
practice of using O 2 beams to flood the surface when analyzing positive ions. 
Similarly, the negative ion signals can be enhanced by using Cs + primary ion 
beams. 

One of the theories that attempts to explain the opposing effects of O 2 and 
Cs + beams involves charge transfer by electron tunneling between the target 
and ions leaving the target surface. Negative ion (O 2 ) bombardment repels 
charge from the surface, in effect lowering its Fermi energy and raising its 
effective work function (<t>). Tunneling is now favored from the surface atom 
(soon to be ejected positive ion) into the now empty electron states of the 
target. Similarly, positive ion (Cs + ) bombardment lowers the target work 
function. Now electrons tunnel from the target into empty levels of surface 
atoms, enhancing the creation of negative ions. Since these charge transfer 
processes depend exponentially on 0, very large changes in ion yields with 
small shifts in <j> are possible. 

A schematic depicting the basic elements of a double-focusing SIMS spec¬ 
trometer is shown in Fig. 6-25. The primary ions most frequently employed 
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Figure 6-25. Schematic of the ion optical system in the Cameca double-focusing 
mass spectrometer: 1. Cs ion source; 2. duoplasmatron source; 3. primary beam mass 
filter; 4. immersion lens; 5. specimen; 6. dynamic transfer system; 7. transfer optical 
system; 8. entrance slit; 9. electrostatic sector; 10. energy slit; 11. spectrometer lens; 
12. spectrometer; 13. electromagnet; 14. exit slit; 15. projection lens; 16. projection 
display and detection system; 17. deflector; 18. channel plate; 19. fluorescent screen; 
20. deflector; 21. Faraday cup; 22. electron multiplier. (Courtesy Cameca Instruments, 
Inc., Stamford, Connecticut). 


are Ar + , , and Cs + , and these are focused into a beam ranging from 2 to 
15 keV in energy. The sputtered charged atoms and compound fragments are 
extracted and enter an electrostatic energy analyzer similar to the cylindrical 
and spherical electron energy analyzers employed in AES and XPS work. 
Those secondary ions that pass now enter a magnetic sector mass filter whose 
function is to select a particular mass for detection. The desired ion of mass 
Af, charge q, and velocity v traces an arc of radius r in the magnetic field (B) 
of the electromagnet, given by 


r = Mv/qB. 


( 6 - 26 ) 
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Table 6-4. Comparison between Static and Dynamic SIMS 


Variable 

Static SIMS 

Dynamic SIMS 

Residual gas pressure (torr) 

io-Mo 10 

io- 7 

Primary ion energy (keV) 

0.5-3 

3-20 

Current density (A/cm 2 ) 

o 

i 

1 

© 

O' 

10“ 5 -10“ 2 

Area of analysis (cm 2 ) 

0.1 

10~ 4 

Sputter rate (sec/monolaycr) 

10 4 

1 


Two modes of operation are possible. In the imaging mode, B is fixed, and 
simultaneous projection of ions of different mass on the channel plate yields 
mass spectra maps in the form of an ion micrograph (spectrograph). In the 
mass spectro/we/ry mode, B is scanned, and each ion mass is sequentially 
detected by the electron multiplier detector. Magnetic sector mass filters 
possess high-resolution capability (M/AM ^ 3000-20,000). Another com¬ 
mon version of SIMS employs a quadrupole lens of lower resolution; i.e., 
M/AM = 500. 

A further distinction is made between what is known as ‘‘static” and 
“dynamic” SIMS. The issue that distinguishes them is the rate of specimen 
erosion relative to the time necessary to acquire data. Static SIMS requires that 
data be collected before the surface is appreciably modified by ion bombard¬ 
ment. It is well suited to surface analysis and the detection of contaminants 
such as hydrocarbons. Dynamic SIMS, on the other hand, implies that high 
sputtering rates are operative during measurement. This, of course, enables 
depth profiling of surface layers. Typical operating parameters for both static 
and dynamic SIMS are given in Table 6-4. 

One of the unique features of SIMS is its mass discrimination, which often 
provides interesting processing information. For example, consider a bipolar 
transistor that contains both Sb and Sb in the emitter but only Sb in the 
collector. The emitter doped with both naturally occurring isotopes of anti¬ 
mony must have been diffused while the collector was obviously ion-im¬ 
planted. Extreme discrimination is required when two species of very similar 
mass must be distinguished. For example, to separate the 31 P (mass = 30.974) 
signal from that of the interfering 30 SiH (mass = 30.982) background level, 
the resolution required is M/AM = 3872. With high-resolution mass spec¬ 
trometers, these species have been separated and phosphorous doping profiles 
obtained as shown in Fig. 6-26. 
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DEPTH (Angstroms) 

Figure 6-26. SIMS depth profile of P implanted in Si obtained with Cs + ions. 
(Courtesy of H. Luftman, AT&T Bell Laboratories.) 


6.4.9. Applications 

The chapter closes with the self-explanatory Table 6-5, in which some applica¬ 
tions of the surface analytical techniques just considered are listed. The varied 
phenomena, materials, and structures involved extend into all facets of thin-film 
science and technology. Ever-expanding applications will ensure future growth 
of the list. 



Exercises 



1. Assume you are given samples of the following items: 

a. a glass camera lens coated with a purple-colored film. 

b. a steel drill coated with a gold-colored metallic layer. 

c. a plastic potato chip bag coated with a thin film. 

In each case how would you experimentally determine the film or coating 
composition and thickness? 

2 . Material deposits onto a quartz crystal thickness monitor from two 
separate evaporation sources, A and B. Explain possible difference in the 
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Table 6-5. Applications of Surface Analytical Techniques in Thin Films 



Application 

Information Obtained, Comments 

Technique 

1 . 

Nucleation and growth 

Distinctions among island, layer 
and S.K. growth modes (p. 220) 

AES 

2. 

Diffusion in metal 

Diffusion coefficients are 

AES, SIMS, 


films 

obtained by sputter sectioning, 
and surface accumulation methods 

RBS 

3. 

Doped semiconductors 

Diffused and ion-implanted depth 
profiles in Si and GaAs 

SIMS 

4. 

Compound formation 

Stoichiometry of 
intermetallic and silicide 
compounds, growth kinetics 

RBS, AES 

5. 

Investigation of 

Identification of elemental 

AES, SIMS, 


surface residues, 

contaminants 

XPS 


stains, haze, and 

discoloration after 
processing 

Identification of compound 
valence and bonding states 

XPS 

6. 

Contamination of 
surfaces by organic 

Identification of elements 

AES, SIMS 

XPS 


materials 

Identification of compounds 

XPS 

7. 

Interfacial analysis 

Cause of adhesion failure, 
segregation of impurities at 
grain boundaries and interfaces 

AES, RBS, 

SIMS, XPS 

8. 

Multilayer films and 

Stoichiometry, layer thickness, 

AES, RBS, 


coatings, superlattices 

interfacial impurities 

SIMS 

9. 

Determination of 
crystalline perfection 

Channeling spectra distinguish 
between single-crystal and 
amorphous Si films 

RBS 

10. 

Fracture of coatings 

Segregation of impurities at 
fracture surfaces 

AES. SIMS 

11. 

Metal - Semiconductor 

Adhesion, contact reactions in 

AES, XPS, 


contacts 

Si and GaAs 

SIMS, RBS 

12. 

Dielectric films on 

Surface contamination, impurity 

AES, SIMS, 


metals and semiconductor 

diffusion, and segregation at 
interfaces (e.g., Si0 2 -Si) 

XPS 

13. 

Molecular-beam epitaxy 

Assessment of surface cleanliness 
prior to deposition, detection of 

C and O contaminants 

AES, XPS 
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frequency shift if 

a. a bilayer film deposits, i.e., first mass m A from source A, then mass 
m B from source B. 

b. an alloy film deposits, i.e., m A and m B deposit simultaneously. Is the 
final film thickness the same in each case? Make any assumptions you 
wish. 

3 . After monitoring the thickness of a deposited Au film with a 6.0-MHz 
quartz (AT cut) crystal monitor, a researcher decides to confirm his 
results employing interferometry. A frequency shift of 10.22 Hz was 
recorded for the film measuring 1.00 cm 2 in the area. Interferometry with 
the Hg green line revealed a displacement of 1.75 fringes across the film 
step. Are these measurements consistent? If not, suggest plausible reasons 
why not? 

[Note: Density of Au is 19.3 g/cm 3 .] 

4 . a. Contrast the optical interference effect when viewing an Si0 2 step 

gauge in white light and in monochromatic light, 
b. Similar step gauges have been prepared for native oxides on GaAs and 
GaP substrates. How would you account for color differences between 
these semiconductors coated with oxides of the same thickness? 

5 . Based on the diffraction pattern of Fig. 6-13, what are the lattice 

o 

parameters for Ag and Pb? [Note: Xc uKa = 1.54 A.] 

6- Recommend specific structural and chemical characterization techniques 
for analyzing the following samples. In each case indicate potential 
difficulties in the analysis. 

a. The surface of a glass slide. 

o 

b. A 1000-A-thick layer of crystalline Si0 2 buried in crystalline Si 3000 

o 

A below the surface. 

c. A thin film of a Bi-Th alloy. (Thorium is radioactive emitting 
high-energy a, 0 , and y radiation.) 

d. A thin carbon deposit on a Be substrate. 

e. A “black” A1 film surface consisting of a deeply creviced, rough 
moundlike topography. (Surface features are typically 1 /on in size.) 

f. A liquid Ga film surface at 35 °C. 

g. A1 2 0 3 powder — 0.8 /*m in size. 

7 . Explain how an SEM/EDX facility could be used to measure the thick¬ 
ness of film A without detaching it from substrate B (film thickness 



Exercises 


303 


standards may be necessary). Can the film thickness be measured this way 
if film and substrate have the same composition? 

8 - Auger electrons emanating from A atoms located a depth z below the 
surface give rise to a signal of intensity 

I a = k[ C(z)e z/Xe dz, 

J o 

where \ e is the electron escape depth, C(z) is the concentration of A 
atoms, and A” is a constant. 

a. What is the intensity from a pure A material? 

b. A pure A substrate is covered by a film of thickness d so that 
C A (z) = 0 for d > z, and C A (z) =1 for oo > z > d. Show that 
I a ~ 4v(0) ex P - d/\, where / A (0) is the signal intensity from a pure 
A surface. Does this suggest a way to determine dl 

9- The surface of a film contained the following elements with the indicated 
sensitivities, S , and Auger intensities, / (in arbitrary units) 


Element 

s 

/ 

Ga 

0.68 

6,950 

As 

0.68 

5,100 

A1 

0.23 

3,040 

O 

0.71 

26,900 

C 

0.30 

5,000 

F 

1.0 

40,500 

Cl 

0.89 

3,700 


What is the composition of the surface in atomic percent? Are the binding 
energies of Ga in the compounds GaF 3 , Ga 2 0 3 , and GaAs (Fig. 6-20) 
consistent with what you know about the chemistry of these materials? 

10. a. Sketch the RBS spectrum for a 900-A-thick Pt film on a Si wafer. How 

o 

does it differ from the spectrum of 900 A of PtSi on Si? In both cases 
2.0-MeV 4 He + ions are employed. 

b. Sketch the RBS spectrum for the case of a 900-A Si film on a thick Pt 
substrate. 

11. Consider the RBS spectra of Fig. 8-12. 

a. Calculate the value of a Au /cr A1 from the initial Au and A1 data. How 
does it compare with the value obtained from Eq. 6-23? Assume 
6 = 170°. 

b. If the initial order of film stacking were reversed, what would the 
resulting RBS spectrum look like initially and after annealing? 
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c. Are the stoichiometries of Au 2 A1 and AuA 1 2 consistent with the value 
for °au/ ct ai? 

12. Refer to the RBS spectra of Fig. 13-9. 

a. What is the significance of the yield counts of — 1000 and 3000 per 
channel in each Si spectrum? 

b. What is the width of the a-Si layer after 15 min at 515 °C? 

c. How does the total number of Au atoms partitioned in a-Si change as a 
function of annealing time? 

d. From the width and height of spectral features calculate the solubility 
of Au in a-Si after 85 min. 

13. The implanted P dopant distribution shown in the SIMS spectrum of Fig. 

6-26 can be described by the equation 


C(z) 


<t> I z-R p \ 2 

V27a R p P 


( 13 - 24 ) 


with terms defined on p. 613. 

a. Show that the distribution appears to be Gaussian by replotting the data 
in a form suggested by Eq. 13-24; i.e., InC vs. (z - R p ) 2 . 

b. What is the value of R py the projected range? What is the value for 
A R pJ the longitudinal straggle? 

c. What is the dose </>? 

14. An Ar primary ion beam of 1 keV energy is used to sputter-etch Cu 
during AES depth profiling. The ion current is 10“ 8 A, and the area 
sputtered is 0.5 cm x 0.5 cm. 

a. Predict the sputter rate of Cu in units of monolayers/min. Assume the 

o 

(100) surface is exposed, and the lattice parameter of Cu = 3.61 A. 

b. The beam energy is raised to 10 keV where the sputter yield is 6.25. 
Estimate the rate of Cu removal at a current of 10“ 6 A. 

15. Moseley’s law for K a X-ray emission lines suggests a correlation 
between X-ray energy E and atomic number Z; i.e., 

E = (3/4)Ry ch(Z- l) 2 , 

where Ry = Rydberg constant (Ry = 1.0974 X 10 5 cm'') 
c = speed of light 
h = Planck constant 
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a. Calculate the energy of the Ka line for Ti. 

b. Make a plot of \fE vs. Z (Moseley diagram) utilizing the data of Fig. 
6-16. Calculate the slope of the line. 

c. Relative to Ti how are the spectral lines of Fe positioned in the 
energy-level scheme? 
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Chapter 7 




Epitaxy 


7.1. Introduction 


Two ancient Greek words em (epi —placed or resting upon) and (taxis 

—arrangement) are the root of the modem word epitaxy , which describes an 
extremely important phenomenon exhibited by thin films. Epitaxy refers to 
extended single-crystal film formation on top of a crystalline substrate. It was 
probably first observed to occur in alkali halide crystals over a century ago, 
but the actual word epitaxy was apparently introduced into the literature by 
the French mineralogist L. Royer in 1928 (Ref. 1). For many years the 
phenomenon of epitaxy continued to be of scientific interest to numerous 
investigators employing vacuum evaporation, sputtering, and electrodeposi¬ 
tion. A sense of much of this early work on island growth systems (e.g., metal 
films on alkali halide substrates) was given in Chapter 5. Over the past two 
decades, epitaxy has left the laboratory and assumed crucial importance in 
solid-state device processing. Interest has centered on epitaxial films exhibiting 
layer growth. This chapter focuses primarily on such films as well as on 
several broader issues related to epitaxy. 

Two types of epitaxy can be distinguished and each has important scientific 
and technological implications. Homoepitaxy refers to the case where the 
film and substrate are the same material. Epitaxial (epi) Si deposited on Si 
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wafers is the most significant example of homoepitaxy. In fact, one of the first 
steps in the fabrication of bipolar and some MOS transistors is the CVD vapor 
phase epitaxy (VPE) of Si on Si (see Chapter 4). The reader may well ask why 
the underlying Si wafer is not sufficient; why must the single-crystal Si be 
extended by means of the epi film layer? The reason is that the epilayer is 
generally freer of defects, purer than the substrate, and can be doped indepen¬ 
dently of the wafer. A dramatic improvement in the yield of early bipolar 
transistors was the result of incorporating the epi-Si deposition step. The 
second type of epitaxy is known as heteroepitaxy and refers to films and 
substrates composed of different materials, e.g., AlAs deposited on GaAs. 
Heteroepitaxy is, of course, the more common phenomenon. Optoelectronic 
devices such as light-emitting diodes and lasers are based on compound 
semiconductor heteroepitaxial film structures. 

The differences between the two basic types of epitaxy are schematically 
illustrated in Fig. 7-1. When the epilayer and substrate crystal are identical, the 
lattice parameters are perfectly matched and there is no interfacial bond 
straining. In heteroepitaxy the lattice parameters are necessarily unmatched, 
and, depending on the extent of the mismatch, we can envision three distinct 
epitaxial regimes. If the lattice mismatch is very small, then the heterojunction 
interfacial structure is essentially like that for homoepitaxy. However, differ¬ 
ences in film and substrate chemistry and coefficient of thermal expansion can 
strongly influence the electronic properties and perfection of the interface. 
Small lattice mismatch is universally desired and actually achieved in a number 
of important applications through careful composition control of the materials 
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Figure 7-1. Schematic illustration of lattice-matched, strained, and relaxed heteroepi¬ 
taxial structures. Homoepitaxy is structurally very similar to lattice-matched heteroepi¬ 
taxy. 
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Figure 7-2. Cross-sectional model of a three-dimensional integrated circuit. 

involved. Section 7.4 deals with such epitaxial interfaces in compound semi¬ 
conductors and the devices based on them. 

When the film and substrate lattice parameters differ more substantially, we 
may imagine the other cases in Fig. 7-1. Either edge dislocation defects form 
at the interface, or the two lattices strain to accommodate their crystal¬ 
lographic differences. The former situation (relaxed epitaxy) generally prevails 
during later film formation stages irrespective of crystal structure or lattice 
parameter differences. The latter case is the basis of strained-layer heteroepi¬ 
taxy. This phenomenon occurs between film-substrate pairs composed of 
different materials that have the same crystal structure. Lattice parameter 
differences are an order of magnitude larger than in the case of lattice-matched 
heteroepitaxy. Structures consisting of Ge^Si,^ films grown on Si, currently 
under active research study, are important examples of strained layer epitaxy, 
and will be discussed further in Section 7.3. 

Recently, there has been a great deal of exciting research devoted to both the 
basic science of epitaxy and its engineering applications. One area has ad¬ 
dressed the dream of creating three-dimensional integrated circuits possessing 
intrinsically high device packing densities. Rather than a single level of 
processed devices, a vertical multifloor structure can be imagined with each 
level of devices separated from neighboring ones by insulating films. What is 
crucial is the ability to grow an epitaxial semiconductor film on top of an 
amorphous substrate—e.g., Si on Si0 2 as shown in Fig. 7-2 . This will require 
selective nucleation of epi Si at existing crystalline Si, and nowhere else, 
followed by lateral growth across surfaces that are ill suited to epitaxy. 
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Methods to achieve Si on insulator (SOI) epitaxy are mentioned in Section 7.5. 
A second area involves the fabrication of multilayer heterojunction composites. 
These remarkable epitaxial film structures include superlattices and quantum 
wells. Some of their simple properties together with applications involving 
incorporation into actual devices will be deferred until Chapter 14. The 
remainder of this chapter is divided into the following major sections: 

7.2. Structural Aspects of Epitaxial Films 

7.3. Lattice Misfit and Imperfections in Epitaxial Films 

7.4. Epitaxy of Compound Semiconductors 

7.5. Methods for Depositing Epitaxial Semiconductor Films 

7.6. Epitaxial Film Growth and Characterization 


7.2. Structural Aspects of Epitaxial Films 


7.2.1. Single-Crystal Surfaces 

Prior to consideration of epitaxial films, it is instructive to examine the nature 
of the topmost surface layers of a crystalline solid film. The reason the surface 
will generally have different properties than the interior of the film can be 
understood by a schematic cross-sectional view as shown in Fig. 7-3. If the 
surface structure is the predictable extension of the underlying lattice, we have 
the case shown in Fig. 7-3a. The loss of periodicity in one direction will tend 
to alter surface electronic properties and leave dangling bonds to promote 
chemical reactivity. It is more likely though that the structure shown in Fig. 
7-3b will prevail. The absence of bonding forces to underlying atoms results in 
new equilibrium positions that deviate from those in the bulk lattice. A 
disturbed surface layer known as the “selvedge 7 ’ may then be imagined. 
Within this layer the atoms relax in such a way as to preserve the symmetry of 
the bulk lattice parallel to the surface but not normal to it. One result of this, 
for example, could be a surface electric dipole moment in the selvedge. A 
more extreme structural disturbance is depicted in Fig. 7-3c. Here surface 
atoms rearrange into a structure with a symmetry that is quite different from 
the bulk solid. This phenomenon is known as reconstruction and can signifi¬ 
cantly alter many surface structure-sensitive properties, e.g., chemical, atomic 
vibrations, electrical, optical, and mechanical. 

Surface reconstruction is quite common on semiconductor surfaces; perhaps 
the most famous example occurs on the (111) surface of Si. A cut through the 
covalent bonds of the bulk (111) plane to create two exposed surfaces leaves 
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Figure 7-3. Schematic cross-sectional views of close-packed atomic positions at a 
solid surface: (a) bulk exposed plane; (b) atomic relaxation outward; (c) reconstruction 
of outer layers. (From Ref. 2, © Oxford University Press, by permission). 


covalent bonds dangling normal to the surface into the vacuum. Dangling 
bonds are energetically unfavorable, and the surface reduces its overall energy 
by reconstructing in a manner that reduces the number and/or energy of the 
bonds. 

A direct atomic image of the reconstructed surface of (111) Si, obtained by 
scanning tunneling microscopy (STM), is shown in Fig. 7-4. The Nobel Prize 
in physics was awarded to the developers of STM, G. Binnig and H. Rohrer, 
in 1986. The technique involves a highly controlled raster-fashion translation 

o 

of a metal tip possessing an extremely small radius of curvature (< 1000 A), 
over the atomic terrain of a surface. Because the tip is only tens of angstroms 
from the surface, a tunneling current inversely proportional to this distance, 
and varying directly with the topography of the surface atoms, flows. This 
signal is recorded and ultimately converted into an image. STM may be 
thought or as the quantum mechanical analog of the stylus method for 
measuring film thickness (see Section 6.2.3). 

Two other important techniques for analyzing the structure of crystalline 
surfaces and epitaxial films in particular are low-energy electron diffraction 
(LEED) and reflection high-energy electron diffraction (RHEED). Both will be 
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Figure 7-4. Scanning tunneling microscope image of the reconstructed (7 x 7) sur¬ 
face of (111) Si. (Courtesy of Y. Kuk, AT&T Bell Laboratories.) 


described in Section 7.6.2 in connection with MBE, but before that we must 
attend to the two-dimensional geometric arrangements of atoms on a crystalline 
surface and the notation used to identify them. 


7.2.2. Surface Crystallography 

Just as there are 14 Bravais lattices in three dimensions, so there are just five 
unit meshes or nets, corresponding to a two-dimensional surface as shown in 
Fig. 7-5. Points representing atoms may be arranged to outline (1) squares, (2) 
rectangles, (3) centered rectangles, (4) hexagons and (5) arbitrary parallelo¬ 
grams. Miller-type indices are used to denote atom coordinates, directions, and 
distances between lines within the surface. 
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SQUARE RECTANGULAR 



7 = 60° y is arbitrary 

Figure 7-5. The five diperiodic surface nets. 


Consider the mesh of substrate atoms in Fig. 7-6a with an array of adatoms 
situated on the surface as indicated. This combination could, for example, 
correspond to the early growth of an epitaxial layer on the (100) plane of a 
BCC crystalline substrate surface. The adsorbate atoms (shaded in) form a 
rectangular monolayer lattice or overgrowth above the substrate atom positions 
(dots). Unit dimension vectors describing the monolayer lattice (b f ) are simply 
related to those of the substrate lattice (a,). In the x direction b y = 2a x , and 
in the y direction b 2 = 1 a 2 . We, therefore, speak of a P(2 x 1) overlayer 
where P indicates that the unit cell is primitive. Similarly, for an overlayer of 
the same geometry but oriented at 90° with respect to the first case, the 
notation is P(1 X 2). Other examples are shown in Fig. 7-6b, c. Note that in 
Fig. 7-6c the overgrowth layer is rotated with respect to the substrate coordi¬ 
nates and is identified by the letter R . Similarly, C is used to denote the 
centered lattice. 

It is the geometry of the reciprocal lattice, however, and not that of the real 
lattice, that appears as the visible image in diffraction patterns. As the name 
implies, the reciprocal lattice has dimensions and features that are “inverse” 
to those of the real lattice. Long dimensions in real space appear at right angles 
and are shortened in inverse proportion, within the reciprocal space. The 
relation between a, and a 2 and the reciprocal vectors a* and a\ of the unit 


mesh in the reciprocal lattice, is expressed by 


Q\ ' a* = 2 tc& u , 

(7-1) 

Similarly, 


b r b* = 2 t 6 u . 

(7-2) 
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Figure 7-6. (a) Atomic positions of adatoms (shaded) relative to (100) substrate or 

bulk atoms (dots). Corresponding diffraction patterns are shown on the right. (X refers 
to adatom lattice.) (b) (100) C(2 x 2) surface structure, (c) (100) R45° (\/l x 2 \fl) 
surface structure. 


where b^ = 0 if / =£ y and = 1 if / = j. Reciprocal lattices corresponding 
to the real-space structures of Fig. 7-6a are sketched intuitively in the same 
figure. The scale is arbitrary, but the lattice periodicity and symmetry are 
preserved. As a final example, consider what happens when two orientations, 
P {2 X 1) and P(1 X 2), are admixed in roughly equal proportions. The 
reciprocal lattices from each simply superimpose. It is left as an exercise for 
the reader to sketch the resultant pattern. 


7.2.3. Epitaxial Interface Crystallography 

To address issues dealing with the structure of epitaxial interfaces it is first 
necessary to identify the crystallographic orientation relationships between the 
film and the substrate. Unlike the notation used to describe the two-dimen¬ 
sional surfaces of the previous section, the traditional (3-D) Miller indices are 
employed here. For this purpose, the indices of the overgrowth plane are 
written as ( HKL ), and those of the parallel substrate plane at the common 
interface are taken as (hkl). The corresponding parallel directions in the 
overgrowth and substrate planes, denoted by [UVW] and [wuw], respec¬ 
tively, must also be specified. This tetrad of indices written by convention as 
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( HKL ) II (hkl); [UVW] II [uvw] serves to define the epitaxial geometry. For 
example, in the case of parallel epitaxy of Ni on cleaved NaCl, the notation 
would read (001) Ni II (001) NaCl; [100] Ni II [100] NaCl. In this case both 
planes and directions coincide. For (111) PbTell(lll) MgAl 2 0 4 ; [211] 
PbTe II [TOl] MgAl 2 0 4 , the interfacial plane is common, but the directions are 
not. Frequently, the epitaxial relationships can be predicted on the basis of 
lattice-fitting arguments. Those planes and directions that give the best lattice 
fit determine the orientation of the film with respect to the substrate. 

As an example, consider the growth of a (110) Fe (BCC) film on a (110) 
GaAs (zinc blende) substrate. The unit cell lattice parameters for Fe and GaAs 
are 2.866 A and 5.653 A, respectively, suggesting that two Fe cells could be 
accommodated by one of GaAs. A view of the (110) plane of these materials is 
shown in Fig. 7-7. The resulting epitaxial geometry is denoted by (110) 
Fe II (110) GaAs; [200] Fe || [100] GaAs. 

An important quantity that characterizes epitaxy is the lattice misfit, /, of 
the film defined as 


/ = (a 0 (s) - a 0 (f))/a 0 (f) = A a 0 /a 0 (7-3) 


where a 0 (f) and a 0 (.s) Te ^ tT t0 the unstrained lattice parameters of film and 
substrate, respectively. A positive / implies that the initial layers of the 
epitaxial film will be stretched in tension, and a negative / means film 
compression. In the Fe-GaAs system the misfit in the [001] direction is 


a 0 (GaAs) - 2a 0 (Fe) 
2o 0 (Fe) 


5.653 - 2(2.866) 

2 ( 2 . 866 ) 


-0.0138. 


or 1.38%. High-quality epitaxial Fe films have been deposited on GaAs, 
facilitating fundamental studies of ferromagnetism (Ref. 3). 


[ 001 ] [ 111 ] 
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-[ 110 ] 
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(110) Fe 
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a. = 5.653A 


a.= 2.866A 


Figure 7-7. Surface nets of (110) planes of GaAs and a-Fe. (From Ref. 3). 
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7.3. Lattice Misfit and Imperfections in Epitaxial Films 


7.3.1. Lattice Misfit 

In this section we explore some implications of lattice misfit on the perfection 
of epitaxial films. The basic theory that accounts for the elastic-plastic changes 
in the bilayer was introduced by Frank and van der Merwe (Ref. 4). It attempts 
to account for the accommodation of misfit between two lattices rather than 
being a theory of epitaxy per se. The theory predicts that any epitaxial layer 
having a lattice parameter mismatch with the substrate of less than ~ 9% 
would grow pseudomorphically; i.e., for very thin films the deposit would be 
elastically strained to have the same interatomic spacing as the substrate. The 
interface would, therefore, be coherent with atoms on either side lining up. 
With increasing film thickness, the total elastic strain energy increases, eventu¬ 
ally exceeding the energy associated with a relaxed structure consisting of an 
array of so-called misfit dislocations separating wide regions of relatively good 
fit. At this point, the initially strained film would ideally decompose to this 
relaxed structure where the generated dislocations relieve a portion of the 
misfit. As the film continues to grow, more misfit is relieved until at infinite 
thickness the elastic strain is totally eliminated. In the case of epitaxial growth 
without interdiffuson, pseudomorphism exists only up to some critical film 
thickness d c , beyond which misfit dislocations are introduced. Misfit disloca¬ 
tions lie in planes parallel to the interface, and they can be observed by 
transmission electron microscopy. For example, early epitaxial growth of 
CoSi 2 films on Si is accompanied by the honeycomb array of misfit disloca¬ 
tions shown in the TEM image of Fig. 7-8. 

Following Matthews (Refs. 5, 6), an expression for d c can be calculated by 
minimizing the sum of the elastic strain energy E f (per unit area) and 
dislocation energy E d (per unit area) with respect to the film strain e f . 
Assuming that the film and substrate shear moduli, fx, are the same, we have 
approximate expressions for E c and E d \ 


2#i(l + v) 

E e = £}d, 

1 — V 

(7-4) 


(7-5) 


where d is the film thickness, v is Poisson’s ratio, b is the dislocation Burgers 
vector, and R Q is a radius about the dislocation where the strain field 
terminates. Physically these equations indicate that strain energy is a volume 
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Figure 7-8. Misfit dislocations arising from reaction between 70 A Co and Si at 880 
°C, in ultrahigh vacuum. (TEM (weak beam 220) image courtesy of J. M. Gibson, 
AT&T Bell Laboratories.) 


energy that increases linearly with film thickness. In contrast, dislocation 
energy is nearly constant with only a weak dependence on d arising from R 0 . 
Therefore, at some value of d , dislocations are favored. Taking the derivative 
of £\. + E d with respect to Ej and setting it equal to zero gives for the critical 
strain a value 


b 

€* f = -:-—In 

8 7T (1 + v)d 



(7-6) 


The largest possible value for s* is /. If the value of e* predicted by Eq. 7-6 
is larger than /, the film will strain to match the substrate exactly, in which 
case ef will equal /, and E d will be zero. If ef <f, a portion of the misfit 
equal to / — ef will be accommodated by misfit dislocations. By assuming 
ef ~ / at d = d c and that R 0 ~ d c , the critical film thickness prior to misfit 
dislocation formation is expressed by 

/ ' In f — + l). 

8 7T (1 + v)f \ b ) 


d, 


(7-7) 
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0 0,2 0,4 0.6 0.8 1.0 

GERMANIUM FRACTION x 

Figure 7-9. Experimentally determined limits for defect-free strained-layer epitaxy 
of Ge x Si t _ x on Si. Note that / is proportional to Ge fraction. (From Ref. 9). 

In the region where d c is approximately a few thousand angstroms, d c ^ b/2 f. 
This means that the film will be pseudomorphic until the accumulated misfit 
d c f exceeds about half the unit cell dimension or b/2. 

The validity of these ideas has been critically tested on several occasions in 
Si-base materials. By doping Si wafers with varying amounts of boron, whose 
atomic size is smaller than that of Si, the lattice parameter of the substrate can 
be controllably reduced. This affords the opportunity to study defect generation 
in subsequently deposited epitaxial films under conditions of very small lattice 
mismatch. As expected, an increase in / resulted in an increase in misfit 
dislocation density (Ref. 7). 

More recently, experimentation on Ge^Sij .^-Si epitaxial films has ex¬ 
tended a test of the theory into the regime of large lattice misfits (Ref. 8). The 
results are shown in Fig. 7-9 where regions of lattice-strained but defect-free 
(commensurate) epitaxy are distinguished from those of dislocation-relaxed 
(discommensurate) epitaxy. Nature is kinder to us than the Matthews theory 
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would suggest, and considerably thicker films than d c predicted by Eq. 7-7 
can be deposited in practice. The reason is that Ge^Si^ strained-layer films 
are not at equilibrium. Extended dislocation arrays do not form instantaneously 
with well-defined spacings as assumed; rather, dislocations nucleate individu¬ 
ally over an area determined by a width w and unit depth, over which atoms 
above and below the slip plane are displaced by at least b/2. For isolated 
screw dislocations the energy per unit length is equal to fib 2 /4w In R 0 /b. 
When formed in a film of thickness d, its area energy density is then given by 


fib 2 d 

Ed= 


(7-8) 


By equating this to E s (Eq. 7-4), the energy supplied for nucleation, Bean 
(Ref. 8) has shown that for e f = / 


( 1 -")fr 2 ln K\ 

(l + v)$irwf 2 \ b J 


(7-9) 


The solid line of Fig. 7-9 represents the excellent fit of this equation to the 
experimental data for the case where w is arbitrarily chosen to be five [110] 
lattice spacings or — 19.6 A. 

When the lattice misfit becomes large, the spacing between misfit disloca¬ 
tions decreases to the order of only a few lattice spacings. In such a case, 
natural lattice misfit theory breaks down. There are no longer large areas of 
good fit separated by narrow regions of poor fit. Rather, poor fit occurs 
everywhere. 


7.3.2. Sources of Defects in Epitaxial Films 

The issues just raised with respect to misfit dislocations are but a part of a 
larger concern for defects in epitaxial films. In semiconductors it is well 
known that such defects as grain boundaries, dislocations, twins, and stacking 
faults degrade many device properties by altering carrier concentrations and 
mobilities. By acting as nonradiative recombination centers, they serve to 
reduce the minority carrier lifetime and quantum efficiency of photonic de¬ 
vices. For these reasons a very considerable effort has been devoted to the 
study of defects and methods for their elimination. Stringfellow (Ref. 7) has 
divided the sources of defects in epitaxial films into five categories, and it is 
instructive to consider them. 


7.3.2.7. Propagation of Defects from the Substrate into the Epitaxial 
Layer . A classic example of defect propagation from the substrate to the 
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Figure 7-10. Schematic composition of crystal defects in eptiaxial films: (1) thread¬ 
ing edge dislocations; (2) interfacial misfit dislocations; (3) threading screw dislocation; 
(4) growth spiral; (5) stacking fault in film; (6) stacking fault in substrate; (7) oval 
defect; (8) hillock; (9) precipitate or void. 

epitaxial layer results in the extension of an emergent screw dislocation spiral 
from the substrate surface into the growing film, as shown schematically in 
Fig. 7-10. Depositing atoms preferentially seek the accommodating ledge sites 
of the dislocation spiral staircase. Layers of defect-free film then radiate 
laterally to cover the substrate. Except for the threading screw dislocation, an 
epitaxial layer of otherwise good fit is possible. Today’s semiconductor 
substrates, including Si and GaAs, are largely “dislocation-free.” Occasional 
substrate dislocations that are present are apparently a source of dislocations 
observed in homoepitaxial layers. Gross defects such as grain boundaries and 
twins are, however, never present. 


7.3.2.2. Stacking Faults . Stacking faults are crystallographic defects in 
which the proper order of stacking planes is interrupted. For example, 
consider the first three atomic planes or layers of a (111) silicon film. Each of 
these planes may be imagined to be a close-packed array of atomic spheres 
(Fig. l-2a), and each successive layer fits into the interstices of the previous 
layer. Atoms in the second layer (B) have no choice but to nest in one set of 
interstices of the first (A) layer. It now makes a difference which set of B layer 
interstices that atoms of the third layer choose to lie in. If they do not lie above 
the A or B layer atoms, the stacking sequence is ABC, and in a perfect 
epitaxial film the ABCABCABC, etc., order is preserved. If, however, a plane 
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of atoms is missing from the normal sequence, e.g., ACABCABC, or a plane 
of atoms has been inserted, e.g., ABCBABCABC, then stacking fault defects 
are produced. It is established that they propagate from dislocations and oxide 
precipitates at the substrate interface. Misoriented clusters, or nuclei contain¬ 
ing stacking faults, coalesce with normal nuclei and grow into the film in the 
manner of an inverted pyramid. Continuing growth causes the characteristic 
closed triangle shown in Fig. 7-10 to become progressively larger. For (100) 
growth the stacking faults form squares. Appropriate etches are required to 
reveal these defects. In general, the stacking-fault density of homoepitaxial Si 
films increases with decreasing growth temperature. In heteroepitaxial films, 
e.g., GaAs on Si, stacking faults are very common. 

Related to stacking faults are the ubiquitous “oval defects” observed during 
MBE growth (Section 7.5.4) of compound semiconductors. These defects 
shown schematically in Fig. 7-10 are faceted growth hillocks that nucleate at 
the film-substrate interface and nest within the epitaxial layer. They usually 
contain a polycrystalline core bounded by four {111} stacking-fault planes. 
With densities as high as 1000 cm" 2 or more and sizes ranging from 1 /xm 2 to 
- 30 /xm 2 , these defects are a source of great concern. There are a number of 
possible sources for oval defects in GaAs, including carbon contamination of 
the substrate, incomplete desorption of oxides prior to growth, and spitting of 
Ga and Ga 2 0 from melts. 


7.3.2.3. Formation of Precipitates or Disiocation Loops due to Super¬ 
saturation of impurities or Dopants. This category is self-explanatory. 
The precipitates and dislocations usually form during cooling and are the result 
of solid-state reactions subsequent to growth. Films containing high intentional 
or accidental dopant or impurity levels are susceptible to such defects. 


7.3.2.4. Formation of Low-Angle Grain Boundaries and Twins. Low- 
angle grain boundaries and twins arise from misoriented islands that meet and 
coalesce. When this happens, small-angle grain boundaries or crystallographic 
twins result. The lattice stacking is effectively mirrored across a twin plane A, 
i.e., . . . CABCACBAC .... Both types of defects may anneal out by disloca¬ 
tion motion if the temperature is high enough. During heteroepitaxial growth, 
Matthews (Ref. 5) has suggested that there is some ambiguity in the exact 
orientation of small nuclei. He has formulated a rule of thumb that the 
relaxation of elastic misfit strain causes a variation in the orientation of crystal 
planes (in radians) roughly equal to the magnitude of the lattice misfit /. Such 
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an effect would naturally give rise to a network of small-angle grain bound¬ 
aries. 


7.3.2.S. Formation of Misfit Dislocations. Formation of misfit disloca¬ 
tions has been dealt with in the previous section. The mechanisms by which 
misfit dislocations form is of interest. Although misfit dislocations lying in the 
interface between the overgrowth and substrate are the most efficient means of 
relaxing misfit strain, they are not the only dislocations present. Extending 
from the substrate and into the epitaxial film (much like the screw dislocation 
of Fig. 1-6) are so-called threading dislocations. Under the influence of lattice 
strain, the vertical segments in the substrate and film move in opposite 
directions, leaving a segment of misfit dislocation lying in the plane of the 
interface (Fig. 7-10). Therefore, the initial density of threading dislocations 
should correlate with the final density of misfit dislocations, and this has 
indeed been observed. Matthews has estimated that misfits of more than 1-2% 
are necessary to nucleate a typical misfit dislocation, and this is also consistent 
with experimental observation. 

In closing, it is appropriate to comment on the perfection of epitaxial layers. 
Various levels of perfection can be imagined, depending on methods used to 
prepare and evaluate films. Early epitaxial semiconductor films were judged to 
be single crystals based on standard X-ray and electron diffraction techniques 
that are relatively insensitive to slight crystal misorientations. However, the 
subsequent electrical characterization of these films yielded significantly poorer 
electrical properties than anything imaginable in bulk melt-grown single crys¬ 
tals. In general, thinned slices of bulk crystals were more structurally perfect 
than epitaxial films of equivalent thickness. This is certainly true of epitaxial 
films exhibiting island and S.K. growth and most of the films reported in the 
older literature. However, some of today's lattice-matched MBE films, grown 
under exacting conditions, are indeed structurally perfect when judged by the 
unambiguous standard of high-resolution TEM lattice imaging. In fact, the 
crystalline quality now frequently exceeds that attainable in bulk form. 


7.4. Epitaxy of Compound Semiconductors 


7.4.1. Introduction 

Compound semiconductor films have been grown epitaxially on single-crystal 
insulators (e.g., A1 2 0 3 , CaF 2 ) and semiconductor substrates. The latter case 
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Figure 7-11. Conventional AlGaAs-GaAs double-heterostructure injection laser 
structure (not to scale.) 


has attracted the overwhelming bulk of the attention and, unless indicated 
otherwise, will be the only systems discussed. The epitaxy of heterojunctions 
has been crucial in the exploitation of compound semiconductors for optoelec¬ 
tronic device applications. The term heterojunction refers to the interface 
between two single-crystal semiconductors of differing composition and doping 
level brought into contact. There have been two main thrusts with regard to the 
epitaxy of heterojunction structures. In the first, a single or generally limited 
number of different junctions is involved. The intent is to grow suitable binary, 
ternary, or even quaternary compound layers epitaxially on top of a similar 
compound substrate, or vice versa. The most common example is the 
Al^Ga^As-GaAs combination. As we shall see, good epitaxy is ensured 
because the two lattices are very well matched to each other (i.e., low misfit), 
even when the atom fraction x of A1 substituted for Ga is high. From a device 
standpoint, it is significant that the energy band gaps of these materials are 
different; i.e., GaAs has narrower band gap than any of the Al^Ga^As 
compounds. This means that charge carriers will be confined to the low-en¬ 
ergy-gap GaAs film when clad by the wider-gap Al^Ga^As heterojunction 
barriers. This makes carrier population inversion and laser action possible. 
Such layered structures are utilized in devices such as lasers (Fig. 7-11) and 
light-emitting diodes (Fig. 7-12), which have served as light sources in fiber 
optical communication systems. Our scope does not allow for discussion of the 
operation of these or other thin-film optoelectronic devices. For more infor¬ 
mation the reader is referred to the many excellent textbooks on solid-state 
devices (Refs. 10, 11). The second thrust dealing with superlattices will be 
addressed in Section 14.7, 
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Figure 7-12. Schematic of a surface emitting InGaAsP (X = 1.3 /im) light-emitting 
diode. (Courtesy N. K. Dutta, AT&T Bell Laboratories.) 

7.4.2. Compound Semiconductor Materials 

Materials employed for epitaxial optoelectronic devices have been drawn 
largely from a collection of direct-band-gap III-V semiconductors. Although 
the discussion will be primarily limited to them, the results are applicable to 
other materials as well (e.g., II-VI compounds). Table 7-1 contains a list of 
important semiconductors together with some physical properties pertinent to 
epitaxy. When light is emitted from or absorbed in a semiconductor, energy as 
well as momentum must be conserved. In a direct band-gap semiconductor, 
the carrier transitions between the valence and conduction bands occur without 
change in momentum of the two states involved. In the energy-momentum or 
equivalent energy-wave vector, parabola-like (E vs. k) representation of 
semiconductor bands (Ref. 10) (Fig. 7-13), emission of light occurs by a 
vertical electron descent from the minimum conduction-band energy level to 
the maximum vacant level in the valence band. This is what occurs in the 
direct band-gap materials GaAs and InP. However, in indirect band-gap 
semiconductors like Ge and Si, the transition occurs with a change in 
momentum that is essentially accommodated by excitation of lattice vibrations 
and heating of the lattice. This makes direct electron-hole recombination with 
photon emission unlikely. But in direct-band-gap semiconductors, such pro¬ 
cesses are more probable, making them far more efficient (by orders of 
magnitude) light emitters. 

Another implication of the distinction between direct and indirect semicon¬ 
ductors is the variation of the absorption coefficient ( a) as a function of photon 
energy, as shown in Fig. 7-14. The ratio of the photon intensity at a depth x 
below the surface, I(x ), to that incident on it, / 0 , is described by 

/(*) = / 0 exp - ax. 



(7-10) 
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Table 7-1. Semiconductor Properties (25 °C) 


Material 

Lattice 

Parameter 

o 

(A) 

Melting 

Point 

<K) 

Dissociation 

Pressure 

(Atm) 

Thermal 
Expansion 
Coefficient 
,<)-<• « c -i 

Energy Gap 

(eV at 25 °C) 

Electron 

Mobility 

(cm 2 /V-s) 

Diamond 

3.560 

- 4300 


1.0 

5.4 

1800 

Si 

5.431 

1685 


2.33 

1.121 

1350 

Ge 

5.657 

1231 


5.75 

0.681 

3600 

ZnS 

5.409 

3200 


7.3 

3.54 

120 

ZnSe 

5.669 

1790 


7.0 

2.58 

530 

ZnTe 

6.101 

1568 


8.2 

2.26 

530 

CdTe 

6.477 

1365 


5.0 

1.44 

700 

HgTe 

6.460 

943 


1.9 

~ 0,15 


CdS 




4.0 

2.42D 

340 

AlAs 

5.661 

1870 

1.4 

5.2 

2,161 

o 

00 

r^i 

AlSb 

6.136 

1330 

< 10 

3.7 

1.601 

900 

GaP 

5.451 

1750 

35 

5.3 

2.241 

300 

GaAs 

5.653 

1510 

1 

5.8 

1.43D 

- 6500 

GaSb 

6.095 

980 

< 10" 3 

6.9 

0.67D 

5000 

InP 

5.869 

1338 

25 

4.5 

1.27D 

4500 

InAs 

6.068 

1215 

0.3 

4.5 

0.36D 

30000 

InSb 

6.479 

796 

< 10" 3 

4.9 

0.165D 

80000 


D - direct, I = indirect. 
From Refs. 11, 12. 


In all semiconductors, a becomes negligible once the wavelength exceeds the 
cutoff wavelength. This critical wavelength \ c is related to the band-gap 
energy E g by the well-known relation E g - hc/\ c or \ c (pirn) = \.24/E g 
(eV). For direct-band-gap semiconductors the value of a becomes large on the 
short-wavelength side of , signifying that light is absorbed very close to the 
surface. For this reason even thin-film layers of GaAs are adequate, for 
example, in solar cell applications. In Si, on the other hand, a varies more 
gradually with wavelength less than X c because of the necessity for phonon 
participation in light absorption-carrier generation processes. Therefore, effi¬ 
cient solar cell action necessitates thicker layers if indirect semiconductors are 
employed. 

Device applications require, in addition to a direct-band-gap semiconductor, 
a specific wavelength or value of E g at which emission or absorption pro¬ 
cesses are optimized. A further necessity is close matching of the lattice 
parameters (a 0 ) of the heterostructure layers to ensure defect-free interfaces. 
An extremely handy graphical representation of E g and a 0 for elemental and 
major III-V compound semiconductors and their alloys is shown in Fig. 7-15. 
Through its use, the design and selection of complex semiconductor alloys with 
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Figure 7-13. Model of electron transitions between conduction and valence bands in 
(a) direct- and (b) indirect band-gap semiconductors. 

the desired properties may be visualized. Elements and binary compounds are 
represented simply as points. Ternary alloys are denoted by lines between 
constituent binary compounds. When one of the elements is common to both 
compounds, a continuous range of solid-solution ternary alloys form when the 
binaries are alloyed. Thus, the line between InP and InAs represents the 
collection of InAs^P,^ ternary solution alloys, with x dependent on the 
proportions mixed. Within the areas outlined by four binary compounds are the 
quaternary alloys. Therefore, the Ga, .^In ^As, _ y P y system may be thought of 
as arising from suitable combinations of GaAs, GaP, InAs, and InP. There is 
no need though to start with these four binary compounds when synthesizing a 
quaternary; it is usually only necessary to control the vapor pressures of the 
Ga-, In-, As-, and P-bearing species. The solid lines represent the direct-band- 
gap ternary compounds, and the dashed lines refer to materials with an indirect 
band gap. 

An example will illustrate the use of this important diagram. We assume for 
simplicity that a linear law of mixtures governs both the resultant lattice 
constant and energy-gap values. Consider GaAs and AlAs with respective a {} 

o o 

values of 5.6537 A and 5.6611 A. No hint of structural defects at the interface 
between epitaxial films of these compounds is observed in the lattice image of 
Fig. 7-16. The lattice mismatch between these compounds is Aa 0 /a 0 or 
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Figure 7-14. Optical absorption coefficients for various semiconductor materials. 
(Reprinted with permission from J. Wiley and Sons, S\ M. Sze, Semiconductor Devices- 
Physics and Technology , 1985). 


0.14%. This is also the maximum difference between any GaAs-Al v Ga, _ x As 
heterojunction combination. However, for high device reliability a maximum 
mismatch of 0.1 % is desired. Assuming a Al 04 Ga^ 6 As cladding layer compo¬ 
sition in Fig. 7-11, a 0 is calculated to be 5.6563 A. The mismatch with GaAs 
is therefore 0.056% or 0.4 of the maximum mismatch. Similarly, linear 
interpolation between the energy gaps £^(GaAs) = 1.435 eV and /^(AIAs) = 
2.16 eV yields a value of £ (Al 0 4 Ga 0 6 As) = 1.73 eV. Vegard's law express¬ 
ing a linear variation of a 0 with composition is a good assumption for these 
alloys. However, E H values can only be linearly interpolated between direct-gap 
materials and AlAs is an indirect-gap semiconductor. For Al^Ga^^As it has 
been found (Ref. 14) that 

E g ( x) = 1 .424 -F 1.44 x eV ; 


x < 0.45. 


(7-11) 
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7-15. Energy gaps and corresponding lattice constants for various compound semiconductors. (Courtesy 
of P. K. Tien, AT&T Bell Laboratories.) 
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Figure 7-16. Electron microscope lattice image of GaAs-AJAs heterojunction taken 
with [100] illumination. (From Ref. 13). (Courtesy of JOEL USA, Inc.) 

Therefore, the correct value for £^(0.4) = 2.00 eV. In addition, the index of 
refraction n , required for light-guiding properties, varies as (Ref. 15) 

n(x) = 3.590 - 0.710* + 0.091* 2 . (7-12) 

In summary, it is possible to design ternary alloys with E g larger than GaAs, 
with n smaller than GaAs, while maintaining an acceptable lattice match for 
high-quality heterojunctions. This unique combination of properties has led to 
the development of a family of injection lasers, light-emitting diodes, and 
photodetectors based on the GaAs-AlAs system. 

7.4.3. Additional Applications 

7.4.3. 1. Optical Communications . Optical communication systems are 
used to transmit information optically. This is done by converting the initial 
electronic signals into light pulses using laser or light-emitting diode light 
sources. The light is launched at one end of an optical fiber that may extend 
over long distances (e.g., 40 km). At the other end of the system, the light 
pulses are detected by photodiodes or phototransistors and converted back into 
electronic signals that, in telephone applications, finally generate sound. In 
such a system it is crucial to transmit the light with minimum attenuation or 
low optical loss. Great efforts have been made to use the lowest-loss fiber 
possible and minimize loss at the source and detector ends. If optical losses are 
high, it means that the optical signals must be reamplified and that additional, 
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costly repeater stations will be necessary. The magnitude of the problem can be 
appreciated when transoceanic communications systems are involved. In 
silica-based fibers it has been found that minimum transmission losses occur 
with light of approximately 1.3-1.5 fim wavelength. The necessity to operate 
within this infrared wavelength window bears directly on the choice of suitable 
semiconductors and epitaxial deposition technology required to fabricate the 
required sources and detectors. 

Reference to Table 7-1 shows that InP is transparent to 1.3-jtm light, and 
this simplifies the coupling of fibers to devices. A very close lattice match to 
InP ( a 0 = 5.869 A) can be effected by alloying GaAs and In As. Through the 
use of Vegard’s law, it is easily shown that the necessary composition is 
Ga 047 In 053 As. In the same vein, high-performance lasers based on the 
lattice-matched GalnAsP-InP system have recently emerged for optical com¬ 
munications use. 


7.4.Z.2. Silicon Heteroepitaxy (Ref. 8). Since the early 1960s, Si has been 
the semiconductor of choice. Its dominance cannot, however, be attributed 
solely to its electronic properties for it has mediocre carrier mobilities and only 
average breakdown voltage and carrier saturation velocities. The absence of a 
direct band gap rules out light emission and severely limits its efficiency as a 
photodetector. Silicon does, however, possess excellent mechanical and chemi¬ 
cal properties. The high modulus of elasticity and high hardness enable Si 
wafers to withstand the rigors of handling and device processing. Its great 
natural abundance, the ability to readily purify it and the fact that it possesses a 
highly inert and passivating oxide have all helped to secure the dominant role 
for Si in solid-state technology. Nevertheless, Si is being increasingly sup¬ 
planted in high-speed and optical applications by compound semiconductors. 

The idea of combining semiconductors that can be epitaxially grown on 
low-cost Si wafers is very attractive. Monolithic integration of HI-V devices 
with Si-integrated circuits offers the advantages of higher-speed signal process¬ 
ing distributed over larger substrate areas. Furthermore, Si wafers are more 
robust and dissipate heat more rapidly than GaAs wafers. Unfortunately, there 
are severe crystallographic, as well as chemical compatibility problems that 
limit Si-based heteroepitaxy. From data in Table 7-1, it is evident that Si is 
only closely lattice matched to GaP and ZnS. Furthermore, its small lattice 
constant limits the possible epitaxial matching to semiconductor alloys. Never¬ 
theless, high-quality, lattice-mismatched (strained-layer) heterostructures of 
AlGaAs-Si and Ge^S^_*-Si have been prepared and show much promise for 
new device applications. 
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7.4.3.3. Epitaxy in ll-VI Compounds (Ref. 16). Semiconductors based on 
elements from the second (e.g., Cd, Zn, Hg) and sixth (e.g., S, Se, Te) 
columns of the periodic table display a rich array of potentially exploitable 
properties. They have direct energy band gaps ranging from a fraction of an 
electron volt in Hg compounds to over 3.5 eV in ZnS, and low-temperature 
carrier mobilities approaching 10 6 cm 2 /V-sec are available. Interest in the 
wide-gap II-VI compounds has been stimulated by the need for electronically 
addressable flat-panel display devices and for the development of LED and 
injection lasers operating in the blue portion of the visible spectrum. For these 
purposes, ZnSe and ZnS have long been the favored candidates. When the 
group II element is substituted by a magnetic transition ion such as Mn, new 
classes of materials known as diluted magnetic or semimagnetic semiconduc¬ 
tors result. Examples are Cd(Mn)Te or Zn(Mn)Se, and these largely retain the 
semiconducting properties of the pure compound. But the five electrons in the 
unfilled 3d shell of Mn give rise to localized magnetic moments. As a result, 
large magneto-optical effects (e.g., Zeeman splitting in magnetic fields, Fara¬ 
day rotation, etc.) occur and have been exploited in optical isolator devices. 
For this, as well as other potential applications in integrated optics, high-qual¬ 
ity epitaxial films are essential. 


7.5. Methods for Depositing Epitaxial Semiconductor Films 


7.5.1. Liquid Phase Epitaxy 

In this section an account of the processes used to deposit epitaxial semicon¬ 
ductor films is given. We start with LPE, a process in which melts rather than 
vapors are in contact with the growing films. Introduced in the early 1960s, 
LPE is still used to produce heterojunction devices. However, for greater layer 
uniformity and atomic abruptness, it has been supplanted by CVD and MBE 
techniques. LPE involves the precipitation of a crystalline film from a super¬ 
saturated melt onto the parent substrate, which serves as both the template for 
epitaxy and the physical support for the heterostructure. The process can be 
understood by referring to the GaAs binary-phase diagram on p. 31. Consider 
a Ga-rich melt containing 10 at% As. When heated above 950°C, all of the As 
dissolves. If the melt is cooled below the liquidus temperature into the 
two-phase field, it becomes supersaturated with respect to As. Only a melt of 
lower than the original As content can now be in equilibrium with GaAs. The 
excess As is, therefore, rejected from solution in the form of GaAs that grows 
epitaxially on a suitably placed substrate. Many readers will appreciate that the 
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crystals they grew as children from supersaturated aqueous solutions essen¬ 
tially formed by this mechanism. 

Through control of the cooling rates, different kinetics of layer growth 
apply. For example, the melt temperature can either be lowered continuously 
together with the substrate (equilibrium cooling) or separately reduced some 
5-20 °C and then brought into contact with the substrate at the lower 
temperature (step cooling). Theory backed by experiment has demonstrated 
that the epitaxial layer thickness increases with time as / 3/2 for equilibrium 
cooling and as / ,/2 for step cooling (Ref. 10). Correspondingly, the growth 
rates or time derivatives vary as t 1/2 and / _I/2 , respectively. These diffusion- 
controlled kinetics respectively indicate either an increasing or decreasing film 
growth rate with time depending on mechanism. Typical growth rates range 
from - 0.1 to 1 jun/min. A detailed analysis of LPE is extremely compli¬ 
cated in ternary systems because it requires knowledge of the thermodynamic 
equilibria between solid and solutions, nucleation and interface attachment 


FUSED-SILICA 
FURNACE TUBE 



Figure 7-17. Schematic of LPE reactor. (Courtesy of M. B. Panish, AT&T Bell 
Laboratories.) 
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kinetics, solute partitioning, diffusion, and heat transfer. LPE offers several 
advantages over other epitaxial deposition methods, including low-cost appara¬ 
tus capable of yielding films of controlled composition and thickness, with 
lower dislocation densities than the parent substrates. 

To grow multiple GaAs-AlGaAs heterostructures, one translates the seed 
substrate sequentially past a series of crucibles holding melts containing 
various amounts of Ga and As together with such dopants as Zn, Ge, Sn, and 
Se as shown in Fig. 7-17. Each film grown requires a separate melt. Growth is 
typically carried out at temperatures of — 800 °C with maximum cooling rates 
of a few degrees Celsius per minute. Limitations of LPE growth include poor 
thickness uniformity and rough surface morphology particularly in thin layers. 
The CVD and MBE techniques are distinctly superior to LPE in these regards. 

7.5.2. Seeded Lateral Epitaxial Film Growth over Insulators 

The methods we describe here briefly have been successfully implemented in 
Si but not in GaAs or other compound semiconductors. The use of melts 
suggests the inclusion of this subject at this point. Technological needs for 
three-dimensional VLSI and isolation of high-voltage devices have spurred the 
development of techniques to grow epitaxial Si layers over such insulators as 
Si0 2 or sapphire. In the recently proposed LEGO (lateral epitaxial growth 
over oxide) process (Ref. 17), the intent is to form isolated tubs of high-quality 
Si surrounded on all sides by a moat of Si0 2 . Devices fabricated within the 
tubs require the electrical insulation provided by the Si0 2 . As a result they are 
also radiation-hardened or immune from radiation-induced charge effects 
originating in the underlying bulk substrate. The process shown schematically 
in Fig. 7-18 starts with patterning and masking a Si wafer to define the tub 
regions followed by etching of deep-slanted wall troughs. A thick Si0 2 film is 
grown and seed windows are opened down to the substrate by etching away the 
Si0 2 . Then a thick polycrystalline Si layer (—100 fi m thick) is deposited by 
CVD methods. This surface layer is melted by the unidirectional radiant heat 
flux from incoherent light emitted by tungsten halogen arc lamps (lamp 
furnace). The underlying wafer protected by the thermally insulating Si0 2 film 
does not melt except in the seed windows. Crystalline Si nucleates at each 
seed, grows vertically, and then laterally across the Si0 2 , leaving a single¬ 
crystal layer in its wake upon solidification. Lastly, mechanical grinding and 
lapping of the solidified layer prepares the structure for further microdevice 
processing. Conventional dielectric isolation processing also employs a thick 
CVD Si layer. But the latter merely serves as the mechanical handle enabling 
the bulk of the Si wafer to be ground away. 
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Figure 7-18. Schematics of methods employed to isolate single-crystal Si tubs, (left) 
conventional dielectric isolation process; (right) LEGO process. (Courtesy of G. K. 
Celler, AT&T Bell Laboratories.) 


An alternative process for broad-area lateral epitaxial growth over Si0 2 
employs a strip heat source in the form of a hot graphite or tungsten wire, 
scanned laser, or electron beam. After patterning the exposed polycrystalline 
or amorphous Si above the surrounding oxide, the strip sweeps laterally across 
the wafer surface. Local zones of the surface then successively melt and 
recrystallize to yield, under ideal conditions, one large epitaxial Si film layer. 
Analogous processes involving seeded lateral growth and selective deposition 
from the vapor phase also show much promise. 


7.5.3. Vapor Phase Epitaxy (VPE) 

An account of the most widely used VPE methods—chloride, hydride, and 
organometallic CVD processes—has been given in Chapter 4. Here we briefly 
address a couple of novel VPE concepts that have emerged in recent years. 
The first is known as vapor levitation epitaxy (VLE), and the geometry is 
shown in Fig. 7-19. The heated substrate is levitated above a nitrogen track 
close to a porous frit through which the hot gaseous reactants pass. Upon 
impingement on the substrate, chemical reactions and film deposition occur 
while product gases escape into the effluent stream. As a function of radial 
distance from the center of the circular substrate, the gas velocity increases 
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Figure 7-19. (Top) Schematic of VLE process; (bottom) schematic of RTCVD 
process. (Courtesy of M. L. Green, AT&T Bell Laboratories.) 

while the gas concentration profile exhibits depletion. These effects cancel one 
another, and uniform films are deposited. The VLE process was designed for 
the growth of epitaxial HI-V semiconductor films and has certain advantages 
worth noting: 

1. There is no physical contact between substrate and reactor. 

2. Thin layer growth is possible. 

3. Sharp transitions can be produced between film layers of multilayer stacks. 

4. Commercial scale-up appears to be feasible. 
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The second method, known as rapid thermal CVD processing (RTCVD), is 
an elaboration on conventional VPE. Epitaxial deposition is influenced through 
rapid, controlled variations of substrate temperature. Source gases (e.g., 
halides, hydrides, metalorganics) react on low-thermal-mass substrates heated 
by the radiation from external high-intensity lamps (Fig. 7-19). The latter 
enable rapid temperature excursions, and heating rates of hundreds of degrees 
Celsius per second are possible. For m-V semiconductors, high-quality epitax¬ 
ial films have been deposited by first desorbing substrate impurities at elevated 
temperatures followed by immediate lower temperature growth (Ref. 18). 

Very high quality lattice-matched heteroepitaxial films can be grown by 
CVD methods. This is particularly true of OMVPE techniques where atomi¬ 
cally abrupt heterojunction interfaces have been demonstrated in alternating 
AlAs-GaAs (superlattice) structures. Only molecular-beam epitaxy, which is 
considered next, can match or exceed these capabilities. 

7.5.4. Molecular-Beam Epitaxy (Refs. 19-21) 

Molecular-beam epitaxy is conceptually a rather simple single-crystal film 
growth technique that, however, represents the state-of-the-art attainable in 
deposition processing from the vapor phase. It essentially involves highly 
controlled evaporation in an ultrahigh-vacuum (~ 10“ 10 torr) system. Interac¬ 
tion of one or more evaporated beams of atoms or molecules with the 
single-crystal substrate yields the desired epitaxial film. The clean environment 
coupled with the slow growth rate and independent control of the beam 
sources enable the precise fabrication of semiconductor heterostructures at an 
atomic level. Deposition of thin layers from a fraction of a micron thick down 
to a single monolayer is possible. In general, MBE growth rates are quite low, 
and for GaAs materials a value of 1 ^m/h is typical. 

A modem MBE system is displayed in the photograph of Fig. 7-20. 
Representing the ultimate in film deposition control, cleanliness and real-time 
structural and chemical characterization capability, such systems typically cost 
more than $1 million. The heart of a deposition facility is shown schematically 
in Fig. 7-21a. Arrayed around the substrate are semiconductor and dopant 
sources, which usually consist of so-called effusion cells or electron-beam 
guns. The latter are employed for the high-melting Si and Ge materials. On the 
other hand, effusion cells consisting of an isothermal cavity with a hole 
through which the evaporant exits are used for compound semiconductor 
elements and their dopants. Effusion cells behave like small-area sources and 
exhibit a cos <t> emission. Vapor pressures of important compound semiconduc¬ 
tor species are displayed in Fig. 3-2. 
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Figure 7-20. Photograph of multichamber MBE system. (Courtesy of Riber Divi¬ 
sion, Inc. Instruments SA). 


Consider now a substrate positioned a distance / from a source aperture of 
area A, with 0 = 0. An expression for the number of evaporant species 
striking the substrate is 

3.51 X \0 22 PA 

R = - 7-pz— molecules/cm 2 -sec. (7-13) 

Trl 2 (MT) 1 2 K ' 

As an example, consider a Ga source in a system where A = 5 cm 2 and 
/ = 12 cm. At T = 900 °C the vapor pressure P Ga * 1 x 10" 4 torr, and 
substituting Af Ga = 70, the arrival rate of Ga at the substrate is calculated to 
be 1.35 x 10 14 atoms/cm 2 -sec. The As arrival rate is usually much higher, 
and, therefore, film deposition is controlled by the Ga flux. An average 
monolayer of GaAs is 2.83 A thick and contains — 6.3 x 10 14 Ga atoms/cm 2 . 
Hence, the growth rate is calculated to be (1.35 x 10 14 )/(6.3 x 10 14 ) x 2.83 

o 

x 60 = 36 A/min, a rather low rate when compared with VPE. 

One of the recent advances in MBE technology incorporates a gas source to 
supply As and P, as shown in Fig. 7-21b. Organometallics used for this 
purpose are thermally cracked, releasing the group V element as a molecular 
beam into the system. Excellent epitaxial film quality has been obtained by this 
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Figure 7-21 . Arrangement of sources and substrate in (a) conventional MBE system, 
(b) MOMBE system. (Courtesy of M. B. Panish, AT&T Bell Laboratories.) 
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hybrid MBE-OMVPE process, which is known by the acronym MOMBE. 
Hydride gas sources (e.g., AsH 3 , PH 3 ) have also been similarly employed in 
MBE systems. 

In many applications, GaAs-Al^Gaj _ ^ As multilayers are required. For this 
purpose, the Ga and As beams are on continuously, but the A1 source is 
operated intermittently. The actual growth rates are determined by the mea¬ 
sured layer thickness divided by the deposition time. The fraction x can be 
determined from the relation 

^(Al^Ga^As) - R( GaAs) 

* = ^(Al*Ga, _*As) 

where the respective deposition rates R for GaAs and Al^Ga^^As must be 
known. Recommended substrate temperatures for MBE of GaAs range from 
500 to 630 °C. Higher temperatures, by about 50 °C, are required for 
Al^Ga^^As because AlAs is thermally more stable than GaAs. For InP 
growth from In and P 2 beams on (100) InP, substrate temperatures of 350-380 
°C have been used. Similarly, In^Ga^^As films, lattice-matched to InP, have 
been grown between 400 and 430 °C. 


7.6. Epitaxial Film Growth and Characterization (Ref. 22) 


(7-14) 


7.6.1. Film Growth Mechanisms 

Irrespective of whether homo- or heteroepitaxy is involved, it is essential to 
grow atomically smooth and abrupt epitaxial layers. This implies a layer 
growth mechanism, and thermodynamic approaches to layer growth based on 
surface energy arguments have been presented in Chapter 5. Ideally, the 
desired layer-by-layer growth depicted in Fig. 7-22 is achieved through lateral 
terrace, ledge, and kink extension by adatom attachment or detachment. In this 
case the new layer does not grow until the prior one is atomically complete. 
One can also imagine the simultaneous coupled growth of both the new and 
underlying layers. 

In this section we explore the interactions of molecular beams with the 
surface and the steps leading to the incorporation of atoms into the growing 
epitaxial film. Although MBE is the focus, the results are, of course, applica¬ 
ble to other epitaxial film growth sequences. The first step involves surface 
adsorption—the process in which impinging particles enter and interact within 
the transition region between the gas phase and substrate surface. Two kinds of 
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Figure 7-22. Real space representation of the formation of a single complete mono- 
layer; 6 is the fractional layer coverage; corresponding RHEED oscillation signal is 
shown. 


adsorption—namely, physical (physisorption) and chemical (chemisorption) — 
can be distinguished. If the particle (molecule) is stretched or bent but retains 
its identity, and van der Waals forces bond it to the surface, then we speak of 
physisorption. If, however, the particle loses its identity through ionic or 
covalent bonding with substrate atoms, chemisorption is involved. The two can 
be quantitatively distinguished on the basis of heats of adsorption— H p and 
H c , for physisorption and chemisorption, respectively. Typically, H p ~ 0.25 
eV and H c ~ 1-10 eV. 

Now consider a beam of Ga atoms incident on a GaAs surface. Below about 
480 °C, Ga atoms readily physisorb on the surface, but above this temperature 
Ga adsorbs as well as desorbs. Time-resolved mass spectroscopy measure¬ 
ments of the magnitude of the atomic flux desorbing from the substrate have 
revealed details of the mechanism of MBE GaAs film growth (Ref. 23). The 
instantaneous Ga surface concentration, n Ga , is increased by the incident Ga 
beam flux, /?(Ga), and simultaneously reduced by a first-order kinetics 
desorption process. Therefore, 


^Ga 


/?(Ga) 


W Ga 


dt 


r,(Ga) ’ 


(7-15) 
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where r 5 (Ga) is the Ga adatom lifetime and w Ca /r^(Ga) represents the Ga 
desorption flux /? des (Ga). Integrating Eq. 7-15 yields 

/? des (Ga) = *(Ga)[ 1 - exp - t/r s ( Ga)]. (7-16) 

For a rectangular pulse of incident Ga atoms, the detected desorption flux 
closely follows the dependence of Eq. 7-16. Similarly, when the Ga beam is 
abruptly shut off, the desorption rate decays as exp - t/r s (G a). The exponen¬ 
tial rise and decay of the signal is shown schematically in Fig. 7-23a. 

In the case of As 2 molecules incident on a GaAs surface, the lifetime is 
extremely short (t 5 (As 2 ) - 0), so the desorption pulse profile essentially 
mirrors that for deposition (Fig. 7-23b); i.e., /? des (As 2 ) = /?(As 2 ). However, 
on a Ga-covered GaAs surface, t 5 (As 2 ) becomes appreciable, with desorption 
increasing only as the available Ga is consumed (Fig. 23c). These observations 
indicate that in order to adsorb As 2 on GaAs at high temperature, Ga adatoms 
are essential. The detailed model for growth of GaAs requires physisorption of 
mobile As 2 (or As 4 ) precursors followed by dissociation and attachment to Ga 
atoms by chemisorption. Excess As merely re-evaporates, leading to the 
growth of stoichiometric GaAs. In summary, these adsorption-desorption 
effects strongly underscore the kinetic rather than thermodynamic nature and 
control of MBE growth. 

The III-V compound semiconductor films are generally grown with a 2- to 
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Figure 7-23. Deposition and desorption pulse shapes on (111) GaAs for (a) Ga, (b) 
As 2 , (c) As 2 on a Ga-covered surface. (From Ref. 23) 
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Figure 7-24. Atomic mechanisms involved in the sequential deposition of GalnAsP 
on InP (Reprinted with permission from John Wiley and Sons, from G. H. Olsen in 
GalnAsP Alloy Semiconductors , ed. by T. P. Pearsall, Copyright © 1982, John 
Wiley and Sons). 


10-fold excess of the group V element. This maintains the elemental V-III 
impingement flux ratio > 1. In the case of GaAs and Al^Ga, _ x As, this 
condition results in stable stoichiometric film growth for long deposition times. 
In contrast to this so-called As-stabilized growth, there is Ga-stabilized growth, 
which occurs when the flux ratio is approximately 1. An excess of Ga atoms is 
to be avoided, though, because it tends to cause clustering into molten 
droplets. The (100) and (111) surfaces of GaAs and related compounds exhibit 
a variety of reconstructed surface geometries dependent on growth conditions 
and subsequent treatments. For As- and Ga-stabilized growth, (2 X 4) and 
(4 X 2) reconstructions, respectively, have generally been observed on 
(100)GaAs. Other structures (i.e., C(2 X 8) As and C(8 X 2) Ga) have also 
been reported for the indicated stabilized structures. To complicate matters 
further, intermediate structures, e.g., (3 X 1), (4 x 6), (3 X 6), as well as 
mixtures also exist within narrow ranges of growth conditions. The complex 
issues surrounding the existence and behavior of these surface reconstructions 
are being actively researched. 

During epitaxial film deposition of multicomponent semiconductors, the 
mechanisms of substrate chemical reactions and atomic incorporation can be 
quite complex. For example, a proposed model for sequential deposition of the 
first two monolayers of GalnAsP on an InP substrate is depicted in Fig. 7-24 
for the hydride process (Ref. 24). The first step is suggested to involve 
adsorption of P and As atoms. Then GaCl and InCl gas molecules also adsorb 
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in such a way that the Cl atoms dangle outward from the surface. Next, 
gaseous atomic hydrogen adsorbs and reacts with the Cl atoms to form HC1 
molecules, which then desorb. Now the process repeats with P and As 
adsorption, and when the cycle is completed another bilayer of quaternary 
alloy film deposits. This picture accounts for single-crystal film growth and the 
development of variable As-P and Ga-In stoichiometries. 


7.6.2. In Situ Film Characterization 

This section deals with techniques that are capable of monitoring the structure 
and composition of epitaxial films during in situ growth. Both LEED and 
RHEED have this ability. They are distinguished in Fig. 7-25. An ultrahigh- 
vacuum environment is a necessity for both methods because of the sensitivity 
of diffraction to adsorbed impurities and the need to eliminate electron-beam 
scattering by gas molecules. In LEED a low-energy electron beam (— 10-1000 
eV) impinges normally on the film surface and only penetrates a few angstroms 
below the surface. Bragg’s law for both lattice periodicities in the surface plane 
results in cones of diffracted electrons emanating along forward and backseat - 
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Figure 7-25. Experimental arrangements of LEED and RHEED techniques. 
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tered directions. Simultaneous satisfaction of the diffraction conditions means 
that constructive interference occurs where the cones intersect along a set of 
lines or beams radiating from the surface. These backscattered beams are 
intercepted by a set of grids raised to different electric potentials. The first 
grids encountered retard the low-energy inelastic electrons from penetrating. 
The desired diffracted (elastic) electrons of higher energy pass through and, 
accelerated by later grids, produce illuminated spots on the fluorescent screen. 

In RHEED the electron beam is incident on the film surface at a grazing 
angle of a few degrees at most. Electron energies are much higher than for 
LEED and range from 5 to 100 keV. An immediate advantage of RHEED is 
that the measurement apparatus does not physically interfere with deposition 
sources in an MBE system the way LEED does. This is one reason why 
RHEED has become the preferred real-time film characterization accessory in 
MBE systems. 

Both LEED and RHEED patterns of the (7 X 7) structure of the Si(lll) 
surface are shown in Fig. 7-26. To obtain some feel for the nature of these 
diffraction patterns, we think in terms of reciprocal space. Arrays of reciprocal 
lattice points form rods or columns of reciprocal lattice planes shown as 
vertical lines pointing normal to the real surface. They are indexed as (10), 
(20), etc., in Fig. 7-27. Consider now an electron wave of magnitude 2ir /X 
propagating in the direction of the incident radiation and terminating at the 
origin of the reciprocal lattice. Following Ewald, we draw of sphere of radius 
2i r/X about the center. A property of this construction is that the only possible 
directions of the diffracted rays are those that intersect the reflecting sphere at 
reciprocal lattice points as shown. To prove this, we note that the normal to the 
reflecting plane is the vector connecting the ends of the incident and diffracted 
rays. But this vector is also a reciprocal lattice vector. Its magnitude is 2w/a 
(Eq. 7-1), where a is the interplanar spacing for the diffracting plane in 
question. It is obvious from the geometry that 

2 7T 2 7T 

— = 2 x — sin0, 7-17) 

a X 

which reduces to Bragg's law, the requisite condition for diffraction. When the 
electron energies are small as in LEED, the wavelength is relatively large, 
yielding a small Ewald sphere. A sharp spot diffraction pattern is the result. 
The intense hexagonal spot array of Fig. 7-26a reflects the sixfold symmetry of 
the (111) plane, and the six fainter spots in between are the result of the 
(7 x 7) surface reconstruction. 

In RHEED, on the other hand, the high electron energies lead to a very 
large Ewald sphere (Fig. 7-27). The reciprocal lattice rods have finite width 
due to lattice imperfections and thermal vibrations; likewise, the Ewald sphere 
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Figure 7-26. (a) LEED pattern of Si surface. (38-eV electron energy, normal 
incidence) (b) RHEED pattern of Si surface. (5-keV electron energy, along (112) 
azimuth) (Courtesy H. Gossmann, AT&T Bell Laboratories.) 
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Figure 7-27. Ewald sphere construction for LEED and RHEED methods. The film 
plane is horizontal, and reciprocal planes are vertical lines. 


is of finite width because of the incident electron energy spread. Therefore, the 
intersection of the Ewald sphere and rods occurs for some distance along their 
height, resulting in a streaked rather than spotty diffraction pattern. During 
MBE film growth both spotted and streaked patterns can be observed; spots 
occur as a result of three-dimensional volume diffraction at islands or surface 
asperities, whereas streaks characterize smooth layered film growth. These 
features can be seen in the RHEED patterns obtained from MBE-grown GaAs 
films (Fig. 7-28). 

An important attribute of the RHEED technique is that the diffracted beam 
intensity is relatively immune to thermal attenuation arising from lattice 
vibrations. This makes it possible to observe the so-called RHEED oscillations 
during MBE growth at elevated temperatures. The intensity of the specular 
RHEED beam undergoes variations that track the step density on the growing 
surface layer. If we reconsider Fig. 7-22 and associate the maximum beam 
intensity with the flat surface where the fractional coverage 0 = 0 (or 6 = 1), 
then the minimum intensity corresponds to 0 = 0.5. During deposition of a 
complete monolayer, the beam intensity, initially at the crest, falls to a trough 
and then crests again. Film growth is, therefore, characterized by an attenu¬ 
ated, sinelike wave with a period equal to the monolayer formation time. 
Under optimal conditions the oscillations persist for many layers and serve to 
conveniently monitor film growth with atomic resolution. 

The temperature above which RHEED oscillations are expected can be 
easily estimated. The required diffusivity to allow a few atomic jumps to occur 
and smooth terraces before they are covered by a monolayer (per second) is 
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Figure 7-28. RHEED patterns (40 keV, (110) azimuth) and corresponding electron 
micrograph replicas (38,400 x ) of same GaAs surface: ^a) polished and etched GaAs 
substrate heated in vacuum to 580 °C for 5 min; (b) 150-A film of GaAs deposited; (c) 
1 fim of GaAs deposited. (Ref. 23), (Courtesy of A. Y. Cho, AT&T Bell Laboratories). 


roughly 10” 15 cm 2 /sec. By the example in Section 5.3.1, RHEED oscillations 
are predicted to occur above 0.27^, 0.127^, and 0.03 T M on group IV 
elements, metal, and alkali halide substrates, respectively, in reasonable agree¬ 
ment with experiment. 


7.6.3. X-ray Diffraction Analysis of Epitaxial Films (Refs. 22, 25) 

Let us suppose we wish to nondestructive^ measure the composition of a 
ternary epitaxial film of Al x Gaj .^.As on GaAs to an accuracy of 2%. One way 
to do this is to use the connection between the lattice parameter a 0 and x. 
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Vegard’s law then suggests that a 0 must be measured to a precision of 

Aa 0 2 a 0 (AlAs) - a 0 (GaAs) 

—- =--- - f --- - 2.8 x 10" 5 

a 0 100 a 0 (GaAs) 


or 1 part in over 35,000. This is quite a formidable challenge, and neither 
LEED nor RHEED can even remotely approach such a capability. X-ray 
diffraction methods can however, but not easily. By Bragg’s law (Eq. 7-15), 
differentiation yields 

A a AX A 0 

— --. (7-18) 

a X tan 0 


This equation reveals the inadequacy of conventional X-ray diffraction 
methods in meeting the required measurement precision. For example, typical 

o 

CuKa (X = 1.5406 A) radiation from an X-ray tube exhibits a so-called 

o 

spectral dispersion of 0.00046 A, so AX/X = 0.0003. This causes unaccept¬ 
able diffraction peak broadening. In addition, the angular divergence of the 
beam must be several seconds of arc, and it is not possible to achieve this with 
usual slit-type collimation. 



9 (Degrees) 

Figure 7-29. Rocking curve for (004) reflection of ZnSe on GaAs. (Courtesy of B. 
Greenberg, Philips Laboratories, North American Philips Corp.) Inset: Schematic of 
high-resolution double-crystal diffractometer. (From Ref. 25). 
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For these reasons, the high-resolution double-crystal diffractometer, shown 
schematically in Fig. 7-29, is indispensible. It has three special features: 

1. Very high angular stepping accuracy on the 0 axis (i.e., ~ 1 arc second) 

2. Very good angular collimation of the incident X-ray beam (i.e., < 10 arc 
seconds) 

3. Elimination of peak broadening due to spectral dispersion 

The diffractometer consists of a point-focus X-ray source of monochromatic 
radiation that falls on a first collimator crystal composed of the same material 
as the sample epitaxial film (second crystal). When the Bragg condition is 
satisfied, both crystals are precisely parallel. The Bragg condition is simultane¬ 
ously satisfied for all source wavelength components—i.e., no wavelength 
dispersion. During measurement, the sample is rotated or rocked through a 
very small angular range, bringing planes in and out of the Bragg condition. 
The resulting rocking curve diffraction pattern contains the very intense 
substrate peak that serves as the internal standard against which the position of 
the low-intensity epitaxial film peak is measured. 

The following example (Ref. 26) involving ZnSe, a potential blue laser 
material, illustrates the power and importance of the technique. A rocking 

o 

curve of an 1100-A film of ZnSe grown epitaxially on a (001) GaAs substrate 
is shown in Fig. 7-29 for the (004) reflection. In GaAs, a 0 = 5.6537 A and 
a ( 0 O 4 ) - 1-4134 A. Since X = 1.5406 A, Bragg’s law yields 6 = 33.025°. For 
ZnSe, a 0 = 5.6690 A (tf (004) = 1.4173), and the expected Bragg angle for 
unstrained ZnSe is 32.923°. But the actual (004) peak appears at 32.802°, 

o 

which corresponds to a (004) = 1.4219 A. To interpret these findings, note that 
the misfit (Eq. 7-3) in this system is -0.27%, and, hence, ZnSe is biaxially 
compressed in the film plane. Since the film thickness is less than d c (Eq. 7-7), 
it has grown pseudomorphically or coherently with GaAs; we can therefore 
assume that ZnSe has the same lattice constant as GaAs in the interfacial plane. 
However, normal to this plane the ZnSe lattice expands and assumes a 
tetragonal distortion. The measured increase in the (004) interplanar spacing of 
ZnSe is thus consistent with this explanation. 

Before leaving the subject of X-ray diffraction, we briefly comment on its 
application in the characterization of epitaxial superlattices. These structures, 
discussed further in Section 14.7, contain a synthesized periodicity, associated 
with numerous alternating layers, superimposed on the crystalline periodicity 
of each individual layer. Resulting diffraction patterns consist of the intense 
substrate peak flanked on either side by a satellite structure related to the 
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Figure 7-30. High-resolution X-ray rocking curve of a 10-period Ga 047 In 0 53 As-InP 

o o 

superlattice with d s = 540 A and 79-A-thick Ga 047 In 0 53 As layers (data taken with 
four-crystal diffractometer): (a) actual data; (b) simulation assuming no interfacial 
strain; (c) simulation assuming strained layers. (From Ref. 27). 



superlattice period d s . An extension of Bragg’s law gives 


d s 


K - »>)* 

2(sin 0, - sin 0 y ) 


(7-19) 


where d s is the thickness of a neighboring pair of film layers, and n { and n, 
are the diffraction orders. As an illustration, the (004) rocking curve for the 
10-period Ga 0 47 In 0 53 As-InP superlattice is shown in Fig. 7-30a. Interest¬ 
ingly, X-ray rocking curves can be computer-modeled to simulate composi¬ 
tional and dimensional information on superlattices containing abrupt interfaces 
with remarkable precision and sensitivity. Thus, curve c, which closely fits the 
data, models the case where opposite interfaces of each Ga 0 47 In 0 53 As layer 
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are strained differently. Curve b , on the other hand, a poorer fit, models the 
case where interfacial strain is omitted. High-resolution X-ray diffraction 
methods reveal the excellent microscopic detail with which epitaxial films can 
be investigated. 


7.7. Conclusion 


Even after coverage in this as well as Chapter 5, additional references to 
epitaxial films are scattered in various contexts throughout the remainder of the 
book. The most extensive treatment, located in Chapter 14, is devoted largely 
to superlattice structures and emerging electronic devices based on them. 



Exercises 



1- During a drought, there is frequently enough moisture in the atmosphere 
to produce clouds but rain does not fall. Comment on the practice of 
seeding clouds with crystals of Agl to induce ice nucleation and rain 
formation. [Note: The crystal structure of ice is hexagonal with lattice 
constants of a = 4.52 A, c = 7.36 A; the crystal structure of Agl is also 

o o 

hexagonal with a = 4.58 A and c = 7.49 A.] 

2. Fe thin films grown on single-crystal A1 substrates were found to be 
essentially dislocation-free to a thickness of 1400 A, whereas misfit 
dislocations appeared with thicker films. If the lattice parameters of Fe 
and A1 are 2.867 A and 4.050 A, respectively, what are the probable 
indices describing the epitaxial interface crystallography? 

- o 

3. A monatomic FCC material has a lattice parameter of 4 A. For the (110) 
and (111) surfaces, 

a. sketch the direct crystallographic net indicating the primitive unit cell. 

b. draw the reciprocal net. 

c. compare the patterns you drew with the ball model structures and laser 
diffraction patterns of Fig. 5-22. 

d. calculate the spacing between rows (hk). 

e. LEED patterns are generated at normal incidence for electron energies 
of 50 and 200 eV. The crystal surface-screen distance is 200 mm. 
Index the resulting diffraction patterns that would be seen on the 180° 
sector screen. 
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4. a. Calculate the lattice misfit between GaAs and Si. 

b. What is the critical thickness for pseudomorphic growth of GaAs films 
on Si? Is this thickness sufficient for fabrication of devices? 

c. Even though GaP films are more closely lattice-matched to Si, what 
difficulties do you foresee in the high-temperature epitaxial growth of 
this material on Si substrates? 

5. You are asked to suggest II-VI and III-V compounds as heteroepitaxial 
combinations for potential semiconductor device applications. Mention 
two such systems that appear promising and indicate the misfit for each. 

6 . Suppose monolayer formation depicted in Fig. 7-22 corresponds to (2 x 1) 
growth. Sketch the next monolayer if growth leads to the (1 x 2) 
orientation. 

7. After 10 min at 800 °C, a 3-^m-thick layer of GaAs was observed to 
form for both equilibrium and step-cooling LPE growth mechanisms. 

a. How thick were the respective GaAs films after 5 min? 

b. At what time will the growth rates for equilibrium and step cooling be 
identical? 

8 - If the temperature regulation in effusion cells employed in MBE is ±2 
°C, what is the percent variation in the flux of atoms arriving at the 
substrate for the deposition of GaAs films (Ga evaporated at 1200 K, As 2 
evaporated at 510 K.) 

9. Sequential layers of GaAs and AlGaAs films were grown by MBE. The 
GaAs beams were on throughout the deposition, which lasted 1.5 h. The 
A1 beam was alternately on for 0.5 min and off for 1 min during the entire 
run. Film thickness measurements showed that 1.80 fim of GaAs and 
0.35 /im of AlAs were deposited. 

a. What are the growth rates of GaAs and A\ X G } _ v As? 

b. What is x , the atom fraction of A1 in Al^Ga, - a .As? 

c. What are the thicknesses of the GaAs and A1 v Ga, _ ^ As layers? 

d. How many layers of each film were deposited? (From A. Gossard, 
AT&T Bell Laboratories.) 

10. It is desired to make diode lasers that emit coherent radiation with a 
wavelength of 1.24 /Am. For this purpose, III-V compounds or ternary 
sol id-solution alloys derived from them can be utilized. At least four 
possible compound combinations (alloys) will meet the indicated specifi¬ 
cations. For each alloy specify the original pair of binary compounds, the 
composition, and the lattice constant. (Assume linear mixing laws.) 
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11 . The quaternary Ga^In, ^As^P, alloy semiconductors have an energy 
gap and lattice parameter given, respectively, by £ (eV) = 1.35 — 
0.12 y + 0A2y 1 2 3 and a 0 (x, - y)(A) = 0.1894^ - 0.4184a: + 0.0130 xy 
+ 5.869. If it is desired to produce light-wave devices operating at 1.32 
and 1.55 ^m, calculate the values for x and j\ assuming perfect lattice 
matching to InP. 

12. If the residence time of Ga adatoms on GaAs is 10 sec at 895 °C and 7 
sec at 904 °C, what is the expected residence time at 850 °C? 

13. An engineer attached a new cylinder labeled 10% AsH 3 in H 2 to a CVD 
reactor growing epitaxial GaAs films. Instead of the usual dark gray film, 
an orange deposit formed on the reactor walls. What went wrong? (From 
R. Dupuis, AT&T Bell Laboratories.) 

14. One strategy for producing GaAs on Si first involves the deposition of a 
lattice-matched GaP layer. Next, GaAs 0 5 P 0 5 is deposited and forms a 
strained epitaxial layer between GaP and the topmost GaAs film. 

a. Indicate the variation in lattice parameter through the structure from Si 
to GaAs. 

b. What is the nature of the lattice deformation at the GaP-GaAs 0 5 P 0 5 
and GaAs 0 5 P 0 5 -GaAs interfaces? 

c. Very roughly estimate the required thicknesses of both GaP and 
GaAs 0 5 P 0 5 for pseudomorphic growth. 

15. X-ray rocking curves for an epitaxial GalnAsP film lattice-matched to InP 
were recorded for the (002), (006), and (117) reflections using CuKa 
radiation. 

a. What are the Bragg angles for these reflections in InP? 

b. Suppose the lattice mismatch between the epilayer and substrate is 
1.3 x 10 4 . What Bragg angles correspond to the InGaAsP peaks. 

c. For which reflection is the peak separation greatest? 
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Interdiffusion and 
Reactions in Thin Films 


8.1. Introduction 


There is hardly an area related to thin-film formation, properties, and perfor¬ 
mance that is uninfluenced by mass-transport phenomena. This is especially 
true of microelectronic applications, where very small lateral as well as depth 
dimensions of device features and film structures are involved. When these 
characteristic dimensions ( d ) become comparable in magnitude to atomic 
diffusion lengths, then compositional changes can be expected. New phases 
such as precipitates or layered compounds may form from ensuing reactions, 
altering the initial film integrity. This, in turn, frequently leads to instabilities 
in the functioning of components and devices that are manifested by such 
effects as decrease in conductivity as well as short- or even open-circuiting of 
conductors, lack of adhesion, and generation of stress. The time it takes for 
such effects to evolve can be roughly gauged by noting that the diffusion length 
is given by — 2 '/Dt , where D and t are the appropriate diffusivity and time, 
respectively. Therefore t ~ d 2 /AD. As we shall see, D values in films are 
relatively high even at low temperatures, so small film dimensions serve to 
make these characteristic times uncomfortably short. Such problems frequently 
surface when neighboring combinations of materials are chemically reactive. 
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For example, consider the the pitfalls involved in designing a Cu-Ni film 
couple as part of the contact structure for solar cells (Ref 1). Readily available 
high-temperature data in bulk metals extrapolated to 300 °C yield a value of 
3.8 x 10 -24 cm 2 /sec for the diffusion coefficient of Cu in Ni. For a 1000A 
thick Ni film, the interdiffusion time is thus predicted to be (10 _5 ) 2 /4(3.8 x 
10 ~ 24 ) sec, or over 200,000 years! Experiment, however, revealed that these 
metals intermixed in less than an hour. When colored metal films are involved, 
as they are here, the eye can frequently detect the evidence of interdiffusion 
through color or reflectivity changes. The high density of defects, e.g., grain 
boundaries and vacancies, causes deposited films to behave differently from 
bulk metals, and it is a purpose of this chapter to quantitatively define the 
distinctions. Indeed, a far more realistic estimate of the Cu-Ni reaction time 
can be made by utilizing the simple concepts developed in Section 8.2. Other 
examples will be cited involving interdiffusion effects between and among 
various metal film layer combinations employed in Si chip packaging applica¬ 
tions. Practical problems associated with making both stable contacts to 
semiconductor surfaces and reliable interconnections between devices have 
been responsible for generating the bulk of the mass-transport-related concerns 
and studies in thin films. For this reason, issues related to these extremely 
important subjects will be discussed at length. 

While interdiffusion phenomena are driven by chemical concentration gradi¬ 
ents, other mass-transport effects take place even in homogeneous films. These 
rely on other driving forces such as electric fields, thermal gradients, and 
stress fields, which give rise to respective electromigration, thermomigration, 
and creep effects that can similarly threaten film integrity. The Nemst-Ein¬ 
stein equation provides an estimate of the characteristic times required for such 
transport effects to occur. Consider a narrow film stripe that is as wide as it is 
thick. If it can be assumed that the volume of film affected is — d 3 and the 
mass flows through a cross-sectional area rf 2 , then the appropriate velocity is 
d /1. By utilizing Eq. 1-35, we conclude that t « RTd/DF. Large driving 
forces (F), which sometimes exist in films, can conspire with both small d and 
high D values to reduce the time to an undesirably short period. As circuit 
dimensions continue to shrink in the drive toward higher packing densities and 
faster operating speeds, diffusion lengths will decrease and the surface-area- 
to-volume ratio will increase. Despite these tendencies, processing tempera¬ 
tures and heat generated during operation are not being proportionately re¬ 
duced. Therefore, interdiffiision problems are projected to persist and even 
worsen in the future. 

In addition to what may be termed reliability concerns, there are beneficial 
mass-transport effects that are relied on during processing heat treatments in 
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films. Aspects of both of these broad applications will be discussed in this 
chapter in a fundamental way within the context of the following subjects: 

8.2. Fundamentals of Diffusion 

8.3. Interdiffusion in Metal Alloy Films 

8.4. Electromigration in Thin Films 

8.5. Metal-Semiconductor Reactions 

8.6. Silicides and Diffusion Barriers 

8.7. Diffusion During Film Growth 

Before proceeding, the reader may find the survey of diffusion phenomena 
given in Chapter 1 useful and wish to review it. 


8.2. Fundamentals of Diffusion 


8.2.1. Comparative Diffusion Mechanisms 

Diffusion mechanisms attempt to describe the details of atomic migration 
associated with mass transport through materials. The resulting atom move¬ 
ments reflect the marginal properties of materials in that only a very small 
fraction of the total number of lattice sites, namely, those that are unoccupied, 
interstitial, or on surfaces, is involved. An illustration of the vacancy mecha¬ 
nism for diffusion was given on p. 36. Similarly, the lattice diffusivity D L , in 
terms of previously defined quantities, can be written as 

D l = £> 0 exp - E l /RT, (8-1) 

where E L is the energy for atomic diffusion through the lattice on a per-mole 
basis. In poly crystal line thin films the very fine grain size means that a larger 
proportion of atom-defect combinations is associated with grain boundaries, 
dislocations, surfaces, and interfaces, relative to lattice sites, than is the case in 
bulk solids. Less tightly bound atoms at these nonlattice sites are expected to 
attract different point-defect populations and be more mobile than lattice 
atoms. Although the detailed environment may be complex and even varied, 
the time-averaged atomic transport is characterized by the same type of 
Boltzmann behavior expressed by Eq. 8-1. Most importantly, the activation 
energies for grain-boundary, dislocation, and surface diffusion are expected to 
be smaller than E L , leading to higher diffusivities. Therefore, such hetero¬ 
geneities and defects serve as diffusion paths that short-circuit the lattice. 

In order to appreciate the consequences of allowing a number of uncoupled 
transport mechanisms to freely compete, we consider the highly idealized 
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Figure 8-1. Highly idealized polycrystalline film containing square grains, grain 
boundaries, and dislocations. 

polycrystalline film matrix in Fig. 8-1. Grain-boundary slabs of width 5 serve 
as short-circuit diffusion paths even though they may only be 5-10 A wide. 
They separate square-shaped grains of side /. Within the grains are dissociated 
dislocations oriented normal to the film surface. They thread the latter with a 
density p d per cm 2 , and diffusion is assumed to occur through the dislocation 
core whose cross-sectional area is A d . Parallel transport processes normal to 
the film plane are assumed to occur for each mechanism. Under these 
conditions, the number of atoms («,) that flow per unit time is essentially equal 
to the product of the appropriate diffusivity (£>,), concentration gradient 


( dc/dx) iy and transport 

area 

involved. 

Therefore, 
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where L, b and d refer to lattice, grain-boundary, and dislocation quantities. 

The importance of short-circuit mass flow relative to lattice diffusion can be 
quantitatively understood in the case of face-centered cubic metals where data 
for the individual mechanisms are available. A convenient summary of result¬ 
ing diffusion parameters is given by (Ref. 2) 

D l ~ 0.5 exp - \1.QT m /T cm 2 /sec, (8-3a) 

5D b ~ 1.5 X 10 8 exp - S.9T m /T cm 3 /sec, (8-3b) 

A d D d ~ 5.3 x 10 _15 exp - \2.5T M /T cm 4 /sec. (8-3c) 


Lattice: 

Grain 
Boundary: 

Dislocation: 
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These approximate expressions represent average data for a variety of FCC 
metals normalized to the reduced temperature T / T M , where Tm is the melting 
point. As an example, the activation energy for lattice self-diffusion in Au is 
easily estimated through comparison of Eqs. 8-1 and 8-3a, which gives 
E l /RT= [1.0T m /T. Therefore, E L = ll.0RT M or (17.0)(1.99 cal/mole- 
K)( 1336 K) = 45,200 cal/mole. As a first approximation, the preceding 
equations can be assumed to be valid for both self- and dilute impurity 
diffusion. Generalized Arrhenius plots for D L as a function of Tm i T have 
already been introduced in Fig. 5-6 for metal, semiconductor, and alkali halide 
matrices. 

Regimes of dominant diffusion behavior, normalized to the same concentra¬ 
tion gradient, can be mapped as a function of / and p d by equating the various 
h i in Eq. 8-2. The equations of the boundary lines separating the operative 
transport mechanisms are thus 


1 

7 


Dl 

5D b ’ 


(>d = 


Dl 

AdDd 


and — 
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AjjDdPd_ 

*D b 


These are plotted as In 1 / / versus p d in Fig. 8-2 at four levels of T/ T m , 
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Figure 8-2. Regimes of dominant diffusion mechanisms in FCC metal films as a 
function of temperature. (Reprinted with permission from Elsevier Sequoia, S.A. from 
R. W. Balluffi and J. M. Blakely, Thin Solid Films 25, 363, 1975). 
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employing Eqs. 8-3a, b, c. The broken rectangles represent the range of thin 
film values for / and p d that occur in practice. For typical metal films with a 
grain size of 1 or less, grain-boundary diffusion dominates at all practical 
temperatures. Similarly, for dislocation-free epitaxial films where 1// = 0, 
lattice diffusion dominates. Transport at these extremes is intuitively obvious. 
Where the film structure is such that combinations of mechanisms are opera¬ 
tive, different admixtures will occur as a function of temperature. Generally, 
lower temperatures will favor grain boundary and dislocation short-circuiting 
relative to lattice diffusion. 

Surface diffusion is another transport mechanism of relevance to thin films 
because of the large ratio of the number of surface-to-bulk atoms. As noted in 
Chapter 5, this mechanism plays an important role in film nucleation and 


T (°C) T(°C) 

1300 1200 1100 1000 900 1200 1100 1000 900 



Figure 8-3. Diffusion coefficients of various elements in Si and GaAs as a function 
of temperature. (Reprinted with permission from John Wiley and Sons, from S. M. 
Sze, Semiconductor Devices: Physics and Technology , Copyright © 1985, John 
Wiley and Sons). 
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growth processes. Reduced parameters describing measured surface transport 
in FCC metals have been suggested (Ref. 3); e.g., 

6.54 T m „ T m 

D s ~ 0.014exp-—— cm 2 /sec for > 1.3. 

It is well known, however, that surface diffusion varies strongly with ambient 
conditions, surface crystallography, and the nature and composition of surface 
and substrate atoms. 

Systematics similar to those depicted in Fig. 8-2 also govern diffusion 
behavior in ionic solids and semiconductors where grain boundaries and 
dislocations are known to act as short-circuit paths. However, complex space- 
charge effects in ionic solids make a clear separation of lattice and grain¬ 
boundary diffusion difficult in these materials. In semiconductors a great deal 
of impurity diffusion data exists, and these are used in designing and analyzing 
doping treatments for devices. This is a specialized field, and complex 
modeling (Ref. 4) is required to accurately describe diffusion profiles. Due to 
the importance of Si and GaAs films, preferred lattice dopant diffusion data are 
presented in Fig. 8-3 (Ref. 5). Some very recent data on diffusion of noble 
metals Au, Ag, and Cu in amorphous Si films interestingly reveals that the 
activation energy for diffusion in the disordered matrix is very similar to 
values obtained for lattice diffusion in crystalline Si. (Ref. 6). 


8.2.2. Grain-Boundary Diffusion 

Of all the mass-transport mechanisms in films, grain-boundary (GB) diffusion 
has probably received the greatest attention. This is a consequence of the rather 
small grain size and high density of boundaries in deposited films. Rapid 
diffusion within individual GBs coupled with their great profusion make them 
the pathways through which the major amount of mass is transported. Low 
diffusional activation energies foster low-temperature transport, creating seri¬ 
ous reliability problems whose origins can frequently be traced to GB involve¬ 
ment. This has motivated the modeling of both GB diffusion and phenomena 
related to film degradation processes. 

The first treatment of GB diffusion appeared nearly 40 years ago. The 
Fisher model (Ref. 7) of GB diffusion considers transport within a semi-in¬ 
finite bicrystal film initially free of diffusant, as shown in Fig. 8-4. A diffusant 
whose concentration C 0 is permanently maintained at plane y = 0 diffuses 
into the GB and the two adjoining grains. At low temperatures in typical 
poly crystalline films, it is easily shown that there is far more transport down 
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C 
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Figure 8-4. Representation of diffusional penetration down a grain boundary (y 
direction) with simultaneous lateral diffusion into adjoining grains (x direction). 


the GB than there is into the matrix of the grains. The ratio of these two fluxes 
can be estimated through the use of Eqs. 8-2 and 8-3 for FCC metals; i.e., 

fth &D h 3 x 10“ 8 8.1 T m 


Assuming l = 10 -4 cm and T / T M = 1/3, we have ft b /ft } - 1.1 X 10 7 . For 
this reason, we may envision transport to consist primarily of a deep rapid 
penetration down the GB from which diffusant subsequently diffuses laterally 
into the adjoining grains, building up the concentration level there. This is 
shown schematically in Fig. 8-4 and described mathematically by 


C L (x, y, t) = C 0 exp - 


WDl_ 

5D b 'firt 


y • erfc 


2 


where C L {x y y , /) is the diffusant concentration at any position and time. 

The Fisher analysis of the complex, coupled GB-lattice diffusion process 
yields simplified decoupled solutions—an exponential diffusant profile in the 
GB and an error function profile within adjoining grains. Experimental verifi¬ 
cation of Eq. 8-4 is accomplished by measurement of the integrated concentra¬ 
tion C within incremental slices Ay thick (e.g., by sputtering) normal to the 
y = 0 surface; i.e., 

C= f C L (x y y t t) dx Ay = const e~ (2 ^^ /8Db ^ )l/2y . (8-5) 

J — oo 






8.2. Fundamentals of Diffusion 


363 


The last equation suggests that a plot of In C versus y is linear. Therefore, the 
useful result 


bD b = 


1.0 

\fit 




( 8 - 6 ) 


emerges. However, in order to obtain dD bJ the value of D t in the same 
system must be independently known. This poses no problem usually, since 
lattice diffusivity data are relatively plentiful. Exact, but far more complicated, 
integral solutions, that are free of the simplifications of the Fisher analysis, 
have been obtained by Whipple (Ref. 8) and Suzuoka (Ref. 9). A conclusion, 
based on these analyses, that has been extensively used is 


5 D b = -0.66 


dlnC\ 5/3 14D. 


1/2 


(8-7) 


Apart from overriding questions of correctness, the difference between Eqs. 
8-6 and 8-7 is that In C is plotted versus y in the former and versus y 6/5 in 
the latter. Frequently, however, the experimental concentration profiles are not 
sufficiently precise to distinguish between these two spatial dependencies. It 
does not matter that actual films are not composed of bicrystals, but rather 
polycrystals with GBs of varying type and orientation; the general character of 
the solutions is preserved despite the geometric complexity. A schematic 
representation of equiconcentration profiles in a polycrystalline film containing 
an array of parallel GBs is shown in Fig. 8-5. At elevated temperatures the 
extensive amount of lattice diffusion masks the penetration through GBs. At 
the lowest temperatures, virtually all of the diffusant is partitioned to GBs. In 
between, the admixture of diffusion mechanisms results in an initial rapid 
penetration down the short-circuit network, which slows down as atoms leak 
into the lattice. The behaviors indicated in Fig. 8-5 represent the so-called A-, 
B-, and C-type kinetics (Ref. 10). Polycrystalline film diffusion phenomena 
have been studied in the B to C range for the most part. Excellent reviews of 



A KINETICS B KINETICS C KINETICS 


Figure 8-5. Schematic representation of type A (highest-temperature), B, and C 
(lowest-temperature) diffusion kinetics. (From Ref. 10). 
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the mathematical theories of GB diffusion including discussions of transport in 
these different temperature regimes, and applications to thin-film data are 
available (Refs. 11, 12). The best general source of this information is the 
volume Thin Films—Interdiffusion and Reactions , edited by Poate, Tu, and 
Mayer (Ref. 12) which also serves as an authoritative reference for much of 
the material discussed in this chapter. This book also contains a wealth of 
experimental mass transport data in thin-film systems. 

The experimental measurements of the penetration of radioactive i95 Au into 
epitaxial (Fig. 8-6a) and polycrystalline (Fig. 8-6b) Au films provide a test of 
the above theories. They also importantly illustrate how the spectrum of 
diffusion behavior can be decomposed into the individual component mecha¬ 
nisms through judicious choice of film temperature and grain size. These data 
were obtained by incrementally sputter-sectioning the film, collecting the 
removed material in each section, and then counting its activity level. Very low 



PENETRATION DISTANCE (I0‘ 6 cm) 

Figure 8-6a. Diffusional penetration profiles of I95 Au in (001) epitiaxial Au films at 
indicated temperatures and times. Lattice and dislocation diffusion dominate. (From 
Ref. 13). 
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Figure 8-6b. Diffusional penetration profiles of l,s Au in polycrystalline films at 
indicated temperatures and times. Only GB diffusion is evident. (Reprinted with 
permission from Elsevier Sequoia, S.A., From D. Gupta and K. W. Asai, Thin Solid 
Films 22, 121, 1974). 


concentration levels can be detected because radiation-counting equipment is 
quite sensitive and highly selective. This makes it possible to measure shallow 
profiles and detect penetration at very low temperatures. The epitaxial film 

o 

data display Gaussian-type lattice diffusion for the first 1000 to 1500 A, 
followed by a transition to apparent dislocation short-circuit transport beyond 
this depth. Rather than high-angle boundaries, these films contained a density 
of some 10 10 to 10" dissociated dislocations per cm 2 . On the other hand, 
extensive low-temperature GB penetration is evident in fig. 8-6b without much 
lattice diffusion. The large differences in diffusional penetration between these 
two sets of data, which are consistent with the systematics illustrated in Fig. 
8-2, should be noted. For epitaxial Au a mixture of lattice and dislocation 
diffusion is expected for p d ~ 10 IO /cm 2 at temperatures of - 0.4T W . Only 
GB diffusion is expected, however, at temperatures of - 0.37’ A/ for a grain 
size of 5 x 10' 5 cm, and this is precisely what was observed. 
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8.2.3. Diffusion in Miscible and Compound-Forming Systems 

8.2.3. f. Miscible Systems . It is helpful to initiate the discussion on diffu¬ 
sion in miscible systems by excluding the complicating effects of grain 
boundaries. Bulk materials contain large enough grains so that the influence of 
GBs is frequently minimal. For thin films a couple where both layers are single 
crystals (e.g., a heteroepitaxial system) must be imagined. Under such condi¬ 
tions, the well-established macroscopic diffusion analyses hold. Upon interdif¬ 
fusion in miscible systems, there is no crystallographic change, for this would 
imply new phases. Rather, each composition will be accessed at some point or 
depth within the film as a continuous range of solid solutions is formed. When 
the intrinsic atomic diffusivities are equal, i.e., D A = the profile is 
symmetric and Eq. l-27a governs the resultant diffusion. On the other hand, it 
is more common that D A =£ D h , so A and B atoms actually migrate with 
unequal velocities because they exchange with vacancies at different rates. 

As an example of a miscible system, consider the much-studied Au-Pd 
polycrystalline thin-film couple in Fig. 8-7 (Ref. 15). Both AES sputter 
sectioning and RBS methods were employed to obtain the indicated profiles, 
whose apparent symmetry probably reflects the lack of a strong diffusivity 
dependence on concentration. It is very tempting to analyze these data by 
fitting them to an error-function-type solution. Effective diffusivity values 
could be obtained, but they would tend to have limited applicability because of 
the heterogeneous character of the film matrix. It must not be forgotten that 


o 



DISTANCE FROM SURFACE (A) 

Figure 8-7. Palladium concentration profiles in a Au-Pd thin-film diffusion couple 
measured by RBS and AES techniques. (Reprinted with permission from Elsevier 
Sequoia, S.A., from P. M. Hall, J. M. Morabito and J. M. Poate, Thin Solid Films 
33, 107, 1976). 
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GB diffusion is the dominant mechanism in this couple. Therefore, the 
appropriate GB analysis is required in order to extract fundamental transport 
parameters. With this approach, it was found that a defect-enhanced admixture 
of GB and lattice diffusion was probably responsible for the large changes in 
the overall composition of the original films. Diffusional activation energies 
obtained for this film system are typically 0.4 times that for bulk diffusion, in 
accord with the systematics for GB diffusion. 


0.2.3.2. Compound-Forming Systems . Many of the interesting binary 
combinations employed in thin-film technology react to form compounds. 
Since the usual configuration is a planar composite structure composed of 
elemental films on a flat substrate, layered compound growth occurs. The 
concentration-position profile in such systems is schematically indicated in 
Fig. 8 - 8 . Each of the terminal phases is assumed to be in equilibrium with the 
intermediate compound. The compound shown is stable over a narrow rather 
than broad concentration range. Both types of compound stoichiometries are 
observed to form. With time, the compound thickens as it consumes the a 
phase at one interface and the 0 phase at the other interface. 

It is instructive to begin with the simplified analysis of the kinetics of 
compound growth based on Fig. 8 - 8 . Only the 7 phase (compound) interface 
in equilibrium with a is considered. Since A atoms lost from a are incorpo¬ 
rated into 7 , the shaded areas shown are equal. With respect to the interface 



Figure 8-8. Depiction of intermediate compound formation in an A-B diffusion 
couple. Reaction temperature is dotted in on phase diagram. 
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moving with velocity V , the following mass fluxes of A must be considered: 

I dC a \ 

flux into interface = CV- D\ - 

\ dx I i nt 

flux away from interface = C y V . 

These fluxes remain balanced for all times, so by equating them we have 

dX D a {dCJdx) 


V = 


dt 


c„- a 


( 8 - 8 ) 


where X is the compound layer thickness. From the simple geometric 
construction shown, dC a /dx can be approximated by (C A - C a )/L 0 . There¬ 
fore as growth proceeds, L 0 increases while V decreases. If the shaded area 
within the a phase can be approximated by (1/2)L 0 (C A - C a ), and this is set 
equal to C y X, then L 0 = 2 C y X/(C x - C a ). Substituting for dC a /dx in 
Eq. 8-8 leads to 


dX = D , (C A -CJ 2 
dt IX (C„ - C y )C y ‘ 


(8-9) 


Upon integrating, the a-y portion of the compound layer thickness is obtained 
as 


X = 


[^(c a -cj 2 ] 1/2 /' /2 

l(C a -C y )C y ] l/2 


( 8 - 10 ) 


A similar expression holds for the fi-y interface, and both solutions can be 
added together to yield the final compound layer thickness X y ; i.e., 

X y = const / I/2 . (8-11) 

The important feature to note is that parabolic growth kinetics is predicted. 
Thermally activated growth is also anticipated, but with an effective activation 
energy dependent on an admixture of diffusion parameters from both a and 0 
phases. 

Among the important and extensively studied thin-film compound-forming 
systems are Al-Au (used in interconnection /contact metallurgy) and 
metal-silicon (used as contacts to Si and Si0 2 ); they will be treated later in the 
chapter. Parabolic growth kinetics is almost always observed in these systems. 
When diffusion is sufficiently rapid; however, growth may be limited by the 
speed of interfacial reaction. Linear kinetics varying simply as t then ensue, 
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but only for short times. For longer times linear growth gives way to 
diffusion-controlled parabolic growth. 


8.2.4. The Kirkendall Effect 

The Kirkendall effect has served to illuminate a number of issues concerning 
solid-state diffusion. One of its great successes is the unambiguous identifica¬ 
tion of vacancy motion as the operative atomic transport mechanism during 
interdiffusion in binary alloy systems. The Kirkendall experiment requires a 
diffusion couple with small inert markers located within the diffusion zone 
between the two involved migrating atomic species. An illustration of what 
happens to the marker during thin-film silicide formation is shown in Fig. 8-9. 
Assuming that metal (M) atoms exchange sites more readily with vacancies 
than do Si atoms, more M than Si atoms will sweep past the marker. In effect, 
more of the lattice will move toward the left! To avoid lattice stress or void 
generation, the marker responds by shifting as a whole toward the right. The 
reverse is true if Si is the dominant migrating specie. Such marker motion has 
indeed been observed in an elegant experiment (Ref. 16) employing RBS 
methods to analyze the reaction between a thin Ni film and a Si wafer. 
Implanted Xe, which served as the inert maker, moved toward the surface of 


SILICIDE 



SILICIDE 


SILICIDE 




MARKER 
Si DIFFUSION 


Figure 8-9. Schematic of Kirkendall marker motion during silicide formation. (Re¬ 
printed with permission from John Wiley and Sons, from J. M. Poate, K. N. Tu and J. 
W. Mayer, eds., Thin Films: Interdiffusion and Reactions , Copyright © 1978, John 
Wiley and Sons). 
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the couple during formation of Ni 2 Si. The interpretation, therefore, is that Ni 
is the dominant diffusing specie. 

8.2.5. Diffusion Size Effects (Ref. 17) 

A linear theory of diffusion has been utilized to describe the various transport 
effects we have considered to this point and, except for this section, will be 
assumed for the remainder of the thin-film applications in this book. The 
macroscopic Fick diffusion equations defined by Eqs. 1-21 and 1-24 suffice as 
an operating definition of what is meant by linear diffusion theory. There are, 
however, nonlinear diffusion effects that may arise in thin-film structures when 
relatively large composition changes occur over very small distances (e.g., 
superlattices). To understand nonlinear effects, we reconsider atomic diffusion 
between neighboring planes in the presence of a free-energy gradient driving 
force. As a convenient starting point, two pertinent equations (1-33 and 1-35) 
describing this motion are reproduced here: 


G d AG 

r„ = 2 vexp - — sinh—, 

( 8 - 12 ) 

v = DF/RT. 

(8-13) 

An expansion of sinh(AG/ RT) yields 


AG AG 1 / AG \ 3 1 / AG \ 5 

sinh- = - + — |-1 + — I-1 + • • • . 

RT RT 3! \ RT) 5\ \ RT ) 

(8-14) 

Under conditions where AG jRT < 1, the higher-order terms are small com¬ 
pared with the first, which is the source of the linear effects expressed by the 


Nemst-Einstein equation. Linear behavior is common because the lattice 
cannot normally support large energy gradients. 

Now consider nonideal concentrated alloys where the free energies per atom 
or chemical potentials, ji, , at nearby planes 1 and 2 are defined by (Eq. 1-9) 

= /x° + kT In flj = /a° + kT In y x C x , 

M 2 = A * 0 + kT In a 2 = + kT In y 2 C 2 * (8-15) 

Here /x = G/N A , N A is Avogadro’s number, the activity a is defined by the 

product of the activity coefficient y and concentration C, and /x° is the 
chemical potential of the specie in the standard state. The force on an atom (/) 
is defined by the negative spatial derivative of /x: 

dfx \k 2 - /x, 
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and the force on a mole of atoms is 




If F is also defined as 2 AG/a 0 (Eq. 1-35), then 

AG a 0 F ln(72^2 /7 i^m) 

~RT ~ 2 RT “ TNf 


(8-16) 


where N t is the number of lattice spacings (a 0 ) included between planes 1 and 
2. The ratio 7 2 C 2 /y X C X typically ranges from 10 to 10 3 . In conventional thin 
films, N t > 100, so AG/RT is small compared with unity. Only the first 
term in the expansion of sinh(AG//?T) need be retained, which, as noted 
earlier, defines diffusion in the linear range. 

Imagine now what happens when N t is about 5 to 10 so that film dimensions 
of only 10-20 A are involved. At the highest values of y 2 C 2 /y x C x , the 
quantity AG/RT is approximately unity. The higher-order terms in Eq. 8-14 
can no longer be neglected now, and this leads to the nonlinear region of 
chemical diffusion. If only the cubic term is retained, a combination of Eqs. 
8-12 to 8-14 and 1-33 to 1-34 yields 


RT\ 24 (RT) 2 ) 


(8-17) 


rather than the Nemst-Einstein relation. For an ideal solution, fi = kT In C, 
and therefore / = (- kT / C)(dC/dx). The macroscopic mass flux J is given 
by the product of atomic concentration and velocity, and accordingly 


J = Cv 



J<_{dC \ 3 
24 C 2 \ dx J 


(8-18) 


The first term is the readily recognizable Fickian flux, which is the source of 
linear diffusion effects. By taking the negative divergence of the flux (Eqs. 
1-23 and 1-24), we obtain 


dC d 2 C Da\ ldC\ 2 d 2 C 
dt ^ dx 2 + 8 C 2 \ dx / dx 2 


(8-19) 


This fifth-degree nonlinear differential equation was developed by Tu (Ref. 
17), who termed it the “kinetic nonlinear” equation. It should not be confused 
with a similar “thermodynamic nonlinear” equation containing a d A C/dx A 
term used to describe diffusion in compositionally modulated films of small 
layer thickness. 

As a simple example illustrating diffusion size effects, consider a composite 
film structure with layers d thick. Initially, the solute concentration varies 
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sinusoidally above an average level C 0 as 
C(x,0) = C 0 + C/Sin (irx/rf) 


(C, = constant). 


( 8 - 20 ) 


If the film layer is thick, linear diffusion is expected. The solution that satisfies 
the Fick diffusion equation under the given conditions is 

7 r 2 Dt 


7T X 


C(x, t) = C 0 + C,sin— exp - 

d 


( 8 - 21 ) 


When heated, compositional gradients throughout the film structure are re¬ 
duced through interdiffusion. A measure of the extent of homogenization is 

(C(d/2, t) - C 0 )/C, = e~^ D,ld \ (8-22) 

As the film thickness shrinks, homogenization occurs more rapidly since 
diffusion distances are reduced. On further size reduction, a point is reached 
where large free-energy gradients cause entrance into the regime of nonlinear 
diffusion. Now Eq. 8-19 applies, and the reader can easily show that Eq. 8-21 
is no longer a solution. Initially, compositional smoothing will be governed by 
the nonlinear term, and the actual kinetics and spatial concentration distribu¬ 
tion will therefore be rather complex. But after some homogenization, 
ln( 7 2 C 2 h\C x ) diminishes to the point where the linear diffusion term domi¬ 
nates the interdiffusion. An exponential decay of the profile amplitude should 
then characterize the long-time kinetics. 


8.3. Interdiffusion in Metal Alloy Films 


8.3.1. Reactions at a Solder Joint 

One of the best ways to appreciate the importance of interdiffusion effects in 
metal films is to consider the interfacial region between a fabricated Si chip 
and the solder joint that connects it to the outside world. This is shown in Fig. 
8-10, and, following Tu (Ref. 17), we note that the two levels of A1 
metalization interconnections, which contact the Si devices above, must also be 
bonded to the solder ball below. Anyone who has tried to solder A1 is 
acquainted with the difficulties involved. In this case, they are overcome by 
using an evaporated Cr-Cu-Au thin-film structure. Since the surface of A1 is 
easily oxidized, it is difficult to solder with the Pb-Sn alloy, so a Cu layer is 
introduced. The intention is to utilize the fast Cu-Sn reaction to form 
intermetallic compounds. However, Cu adheres poorly to oxidized A1 and Si0 2 
surfaces. Moreover, when molten, the relatively massive Pb-Sn consumes the 
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Figure 8-10. Schematic diagram of solder contact to AJ by means of the trimetal 
Cr-Cu-Au film metallization scheme. (Reprinted with permission from Ref. 17). 

Cu and then tends to dewet on the Al surface. Therefore, Cr is introduced as a 
glue layer between the Al and Si0 2 and to prevent the molten solder from 
dewetting. The Cu surface needs to be protected against atmospheric corrosion 
because corroded Cu surfaces do not solder well. Therefore, a thin film of Au 
is introduced to passivate the Cu surface. Since Au dissolves rapidly in Pb and 
Sn, a solder richer in Pb than the eutectic composition is used. This allows for 
enough Pb to dissolve Au and sufficient Sn to react with Cu. The reaction 
between the excess Pb and Cu is limited because these elements do not form 
extended solid solutions or intermetallic compounds. Thus, in order to fulfill 
the functions of adhesion, soldering, and passivation, this elaborate trimetal 
film structure is required. 

There are additional solid-state diffusion effects between metal layers to 
contend with. At temperatures close to 200 °C, Cu can diffuse rapidly through 
GBs of Cr even though these metals are basically immiscible. When this 
happens, the interfacial adhesion at the Cr-Si0 2 interface is adversely af¬ 
fected. Moreover, Cu can diffuse outward through the Au in which it is 
miscible. At the Au surface it forms an oxide that interferes with soldering. 
This one method of joining (the flip-chip technology) has generated a host of 
mass-transport phenomena. For this reason, there has been considerable inter¬ 
est in thin-film interdiffusion studies of Cr-Cu, Au-Cu, and Cu-Sn systems. 
Other binary combinations from this group of involved elements, such as 
Al-Au and Al-Cu, have received even more attention. Miscible, immiscible, 
as well as compound-forming systems are represented. For the most part, 
vacancy diffusion is the accepted diffusion mechanism. However, in the case 
of the noble metals Cu and Au in the group IV matrices of Sn and Pb, 
anomalously rapid migration through interstitial sites is believed to occur. 
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8.3.2. GB Diffusion in Alloy Films 

There have been a considerable number of fundamental interdiffusion studies 
in metal alloy films that have been interpreted in terms of GB transport 
models. In order to exclude the complicating effects of compounds and 
precipitates, the systems selected were primarily limited to those displaying 
solid solubility. Results from a representative group of investigations are 
entered in Table 8-1, where the diffusivities are expressed in terms of the 
pre-exponential factor ( 8D b ) and activation energy. 

Two broad categories of experimental techniques were employed in gather¬ 
ing these data. Sputter sectioning through the film is the basis of the first 
technique. In the commonly employed AES depth-profiling method, the film is 
analyzed continuously as it is simultaneously thinned by sputter etching. 
Profiles appear similar to those shown in Fig. 8-6a, b. 

The second category is based on permeation and surface accumulation 
techniques. An example shown in Fig. 8-1 la utilizes AES signal-sensing 
methods to detect Ag penetration through a Au film and subsequent spreading 
along the exit surface (Ref. 19). The signal reflects this by building slowly at 
first after an incubation period, then changing more rapidly and finally 
saturating at long times. Diffusivities are then unfolded by fitting data to an 
assumed kinetics (shown in Fig. 8-1 la) of concentration buildup with time. 
When type C kinetics (Fig. 8-5) prevails, D b values may be simply estimated 
by equating the film thickness to the GB diffusion length — 2 y/D b t , where t 


Table 8-1. Grain-Boundary Diffusion in Thin Metal Films 


Diffusant 

Matrix 

5D b (cm 3 /sec) 

E„ (eV) 

Remarks 

A1 

Cu 

5.1 x Hr 7 

0.94 

Polycrystalline films 

i95 Au 

Au 

9.0 x 10- 10 

1.00 

Polycrystalline films 

i95 Au 

Au 

1.9 x 10- 10 

1.16 

(100) epitaxial 

Cu 

A1 

4.5 x 10" 8 

1.00 

Poly crystalline 



1.8 x 10- 10 

0.87 

films 

Ag 

Au 

4.5 x 10- 12 

1.20 

(111) epitaxial 

,95 Au 

Ni-0.5%Co 

1.4 x 10- 10 

1.60 

Polycrystalline films 

Sn 

Pb 

6.9 x 10- 10 

0.62 

Polycrystalline films 

Cr 

Au 

5.0 x 10-" 

1.09 

Textured film 

Ag 

Cu 

1.5 x 10- 13 

0.75 

Polycrystalline films 

Pt 

Cr 

5.0 x 10- 10 

1.69 

Polycrystalline films 

Cu 

Al-0.2%Co 

2.0 x 10“ 7 

0.56 

Polycrystalline films 

Ag 

Au 

5.0 x 10 14 

1.10 

(100) epitaxial 

ll9 Sn 

Sn 

5.0 x 10“ 8 

0.42 

Poly crystal line films 


From Ref. 18, 
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is the time required for the AES signal to appear. Interestingly, a grain¬ 
boundary Kirkendall effect shown in Fig. 8-1 lb has been suggested to account 
for the unequal GB diffusion rates of Ag and Au. 

The high sensitivity inherent in detecting a signal rise from the low back¬ 
ground enables very small atomic fluxes to be measured. This makes it possible 
to monitor transport at quite low temperatures. On the other hand, disadvan¬ 
tages include the electron-beam heating of films during measurement, and the 
need to maintain ultrahigh vacuum conditions because of the sensitivity of 
surface diffusion to ambient contamination. 

Results for the sectioning methods suggest two GB diffusion regimes: 

1. D b = 0.3 exp - 1.5T m /T cm 2 /sec for large-angle boundaries where E h = 
0.5 E l . 

2. D h = 2 exp - 12.57^/T cm 2 /sec for epitaxial films containing subbound¬ 
aries or dissociated dislocations where E h = 0.7 E L . 



Figure 8-1 la. Accumulation (C^) of Ag on Au film surfaces as a function of 
normalized (reduced) time for different diffusional annealing temperatures. Constants S 
and / 0 (incubation time) depend on the run involved. (From Ref. 19). 
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SOURCE SURFACE 

Figure 8-11b. Model of grain-boundary Kirkendall effect. (From Ref. 19). 


For the permeation and surface accumulation methods, the activation energies 
appear to be somewhat lower. Data derived from these techniques are more 
susceptible to GB structure, since diffusant is transported through the most 
highly disordered boundaries first. 


8.3.3. Formation of Intermetallic Compound Films 

Perhaps the best-known example of intermetallic compound formation between 
metal films involves the interaction between A1 and Au. For at least 20 years 
the combination of A1 films and Au wires or balls served to satisfy bonding 
requirements in the semiconductor industry, but not without a significant 
number of reliability problems. When devices are heated to 250-300 °C for a 
few days, an Al-Au reaction proceeds to form a porous intermetallic phase 
around the Au ball bond accompanied by a lacy network of missing Al. One of 
the alloy phases, AuA1 2 , is purple, which accounts for the appellation “purple 
plague/' Even though the evidence is not definitive that the presence of AuA1 2 
correlates with bond embrittlement, lack of strength, or degradation, formation 
of this compound is viewed with concern. 

An extensive RBS study (Ref. 20) of compound formation in the Al-Au 
thin-film system is worth reviewing since it clarifies the role of film thickness. 
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Figure 8-12. RBS spectrum showing formation of AuA 1 2 and Au 2 A1 phases at 
230 °C. (From Ref. 20). 

temperature and time in influencing the reaction. Film couples with A1 on top 
of Au were annealed, yielding RBS spectra such as shown in Fig. 8-12. 
Because there is such a large mass difference between A1 and Au, there is no 
peak overlap despite the unfavorable spatial ordering of the metal layers. The 
stoichiometry of the peak shoulders in the annealed films corresponds to the 
compounds Au 2 Al and AuAl 2 , and the high-energy tail is indicative of Au 
transport through the A1 film to the surface. From the fact that the AuAl 2 
always appears at a higher energy than the Au 2 Al, we know that it is closer to 
the air surface of the couple. By the methods outlined in Section 6.4.7, the 
compound thickness can be determined, and Fig. 8-13a reveals that both 
compounds grow with parabolic kinetics. The slopes of the compound thick- 
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Figure 8-13a. Compound film thickness versus square root of time for AuA 1 2 and 
Au 2 A1 at different annealing temperatures. 



Figure 8-13b. Arrhenius plots for the kinetics of formation of AuA 1 2 and Au 2 A1 
compounds. (From Ref. 20). 

ness-time 1//2 curves are proportional to the ubiquitous Boltzmann factor. 
Therefore, by plotting these slopes (actually the logs of the square of the slope 
in this case) versus 1/7" K in the usual Arrhenius manner (Fig. 8-13b), we 
obtain activation energies for compound growth. The values of 1.03 and 1.2 
eV can be roughly compared with the systematics given for FCC metals to 
elicit some clue as to the mass-transport mechanism for compound formation. 
Based on Au, these energies translate into equivalent Boltzmann factors of 
exp - 8.97 ^/T and exp — 10.47^/ T y respectively, suggesting a GB-as- 
sisted diffusion mechanism. Lastly, it is interesting to note how the sequence of 
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END PHASES 


Figure 8-14. Schematic diagrams illustrating compound formation sequence in 
Al-Au thin film couples. End phases depend on whether d A] > d Au or d A{i > d M . 

compound formation (Fig. 8-14) correlates with the equilibrium phase diagram 
(not shown). When the film thickness of A1 exceeds that of Au, then the latter 
will be totally consumed, leaving excess Al. The observed equilibrium between 
AJ and AuAl 2 layers is consistent with the phase diagram. Similarly, excess 
Au is predicted to finally equilibrate with the Au 4 A1 phase, as observed. 

8 . 4 . Electromigration in Thin Films 

Electromigration, a phenomenon not unlike electrolysis, involves the migration 
of metal atoms along the length of metallic conductors carrying large direct 
current densities. It was observed in liquid metal alloys well over a century ago 
and is a mechanism responsible for failure of tungsten light-bulb filaments. 
Bulk metals approach the melting point when powered with current densities 
( J ) of about 10 4 A/cm 2 . On the other hand, thin films can tolerate densities of 
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(b) 


Figure 8-15. Manifestations of electromigration damage in A1 films: (a) hillock 
growth, (from Ref. 21, courtesy of L. Berenbaum); (b) whisker bridging two conductors 
(courtesy of R. Knoell, AT & T Bell Laboratories); (c) nearby mass accumulation and 
depletion (courtesy S. Vaidya, AT & T Bell Laboratories). 
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(c) 

Figure 8-15. Continued. 


10 6 A/cm 2 without immediate melting or open-circuiting because the Joule 
heat is effectively conducted away by the substrate, which behaves as a 
massive heat sink. In a circuit chip containing some 100,000 devices, there is a 
total of several meters of polycrystalline A1 alloy interconnect stripes that are 
typically less than 1.5 /an wide and 1 fim thick. Under powering, at high 
current densities, mass-transport effects are manifested by void formation, 
mass pileups and hillocks, cracked dielectric film overlayers, grain-boundary 
grooving, localized heating, and thinning along the conductor stripe and near 
contacts. Several examples of such film degradation processes are shown in 
Fig. 8-15. In bootstrap fashion the damage accelerates to the point where 
open-circuiting terminates the life of the conductor. It is for these reasons that 
electromigration has been recognized as a major reliability problem in inte¬ 
grated circuit metallizations for the past quarter century. Indeed, there is some 
truth to a corollary of one of Murphy's laws—“A million-dollar computer will 
protect a 25-cent fuse by blowing first.” Analysis of the extensive accelerated 
testing that has been performed on interconnections has led to a general 
relationship between film mean time to failure (MTF) and J given by 

MTF“ 1 = A:(exp - EJkT)J\ (8-23) 

As with virtually all mass-transport-related reliability problems, damage is 
thermally activated. For A1 conductors, n is typically 2 to 3, and E e , the 
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Figure 8-16. (a) Atomic model of electromigration involving electron momentum 

transfer to metal ion cores during current flow, (b) Model of electromigration damage 
in a powered Film stripe. Mass flux divergences arise from nonuniform grain structure 
and temperature gradients. 

activation energy for electromigration failure, ranges from 0.5 to 0.8 eV, 
depending on grain size. In contrast, an energy of 1.4 eV is associated with 
bulk lattice diffusion so that low-temperature electromigration in films is 
clearly dominated by GB transport. The constant K depends on film structure 
and processing. Current design rules recommend no more than 10 5 A/cm 2 for 
stripe widths of — 1.5 pm. Although Eq. 8-23 is useful in designing metaliza- 
tions, it provides little insight into the atomistic processes involved. 

The mechanism of the interaction between the current carriers and migrating 
atoms is not entirely understood, but it is generally accepted that electrons 
streaming through the conductor are continuously scattered by lattice defects. 
At high enough current densities, sufficient electron momentum is imparted to 
atoms to physically propel them into activated configurations and then toward 
the anode as shown in Fig. 8-16. This electron “wind” force is oppositely 
directed to and normally exceeds the well-shielded electrostatic force on atom 
cores arising from the applied electric field £. Therefore, a net force F acts on 
the ions, given by 

F= Z*q£ = Z*qpJ , (8-24) 

where q is the electronic charge and £ is, in turn, given by the product of the 
electrical resistivity of the metal, p, and J. An “effective” ion valence Z* 
may be defined, and for electron conductors it is negative in sign with a 
magnitude usually measured to be far in excess of typical chemical valences. 
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On a macroscopic level, the observed mass-transport flux, J m , for an element 
of concentration C is given by 

J m = Cv = CDZ*q pJ/RT, (8-25) 

where use has, once again, been made of the Nemst-Einstein relation. 
Electromigration is thus characterized at a fundamental level by the terms Z* 
and D. Although considerable variation in Z* exists, values of the activation 
energy for electrotransport in films usually reflect a grain-boundary diffusion 
mechanism. 

Film damage is caused by a depletion or accumulation of atoms, which is 
defined by either a negative or positive value of dC/dt, respectively. By 
Eq. 1-23, 


dC d I CDZ*qpJ \ d (CDZ*qpJ\dT 

"a7 =_ 3A : ( rt /arl rt Ja*' ^ 8 " 26 ^ 

The first term on the right-hand side reflects the isothermal, structurally 
induced mass flux divergence, and the second term represents mass transport 
in the presence of a temperature gradient. The resulting transport under these 
distinct conditions can be qualitatively understood with reference to Fig. 
8-16b, assuming that atom migration is solely confined to GBs and directed 
toward the anode. Let us first consider electromigration under isothermal 
conditions. Because of varying grain size and orientation distributions, local 
mass flux divergences exist throughout the film. Each cross section of the 
stripe contains a lesser or greater number of effective GB transport channels. If 
more atoms enter a region such as a junction of grains than leave it, a mass 
pileup or growth can be expected. A void develops when the reverse is true. At 
highly heterogeneous sites where, for example, a single grain extends across 
the stripe width and abuts numerous smaller grains, the mass accumulations 
and depletions are exaggerated. For this reason, a uniform distribution of grain 
size is desirable. Of course, single-crystal films would make ideal interconnec¬ 
tions because the source of damage sites is eliminated, but it is not practical to 
deposit them. 

Electromigration frequently occurs in the presence of nonuniform tempera¬ 
ture distributions that develop at various sites within device structures—e.g., at 
locations of poor film adhesion, in regions of different thermal conductivity, 
such as metal-semiconductor contacts or interconnect-dielectric crossovers, at 
nonuniformly covered steps, and at terminals of increased cross section. In 
addition to the influence of microstructure, there is the added complication of 
the temperature gradient. The resulting damage pattern can be understood by 
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considering the second term on the right-hand side of Eq. 8-26. For the 
polarity shown, all terms in parentheses are positive and CqpJ/RT is roughly 
temperature independent, whereas DZ* increases with temperature. There¬ 
fore, dC/dt varies as - dT/dx . Voids will thus form at the negative 
electrode, where dT/ dx > 0, and hillocks will grow at the positive electrode, 
where dT/dx <0. Physically, the drift velocity of atoms at the cathode 
increases as they experience a rising temperature. More atoms then exit the 
region than flow into it. At the anode the atoms decelerate in experiencing 
lower temperatures and thus pile up there. An analogy to this situation is a 
narrow strip of road leading into a wide highway (at the cathode). The 
bottleneck is relieved and the intercar spacing increases. If further down the 
highway it again narrows to a road, a new bottleneck reforms and cars will pile 
up (at the anode). 

Despite considerable efforts to develop alternative interconnect materials, 
Al-base alloys are still universally employed in the industry. Their high 
conductivity, good adhesion, ease of deposition, etchability, and compatibility 
with other processing steps offset the disadvantages of being prone to corrosion 
and electromigration degradation. Nevertheless, attempts to improve the qual¬ 
ity of A1 metallizations have prompted the use of alternative deposition 
methods as well as the development of more electromigration-resistant alloys. 
With regard to the latter, it has been observed that A1 alloyed with a few per¬ 
cent Cu can extend the electromigration life by perhaps an order of magni¬ 
tude relative to pure Al. Reasons for this are not completely understood, but it 
appears that Cu reduces the GB migration of the solvent Al. The higher values 
for E b which are observed are consistent with such an interpretation. Other 
schemes proposed for minimizing electromigration damage have included 

1. Dielectric film encapsulation to suppress free surface growths 

2. Incorporation of oxygen to generally strengthen the matrix through disper¬ 
sion of deformation-resistant A1 2 0 3 particles 

3. Deposition of intervening thin metal layers in a sandwichlike structure that 
can shunt the Al in case it fails 

The future may hold some surprises with respect to electromigration life¬ 
time. Experimental results shown in Fig. 8-17 reveal reduction of film life as 
the linewidth decreases from 4 to 2 fxm in accord with intuitive expectations. 
However, an encouraging increase in lifetime is surprisingly observed for 
submicron-wide stripes. The reason for this is the development of a bamboo¬ 
like grain structure generated in electron-beam evaporated films. Because the 
GBs are oriented normal to the current flow, the stripe effectively behaves as a 
single crystal. Similar benefits are not as pronounced in sputtered films. 
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Figure 8-17. Mean time to failure as a function of stripe linewidth for evaporated 
(E-gun) and sputtered (S-gun, In-S) A1 films. (From Ref. 22). 


8.5. Metal-Semiconductor Reactions 


8.5.1. Introduction to Contacts 

All semiconductor devices and integrated circuits require contacts to connect 
them to other devices and components. When a metal contacts a semiconductor 
surface, two types of electrical behavior can be distinguished in response to an 
applied voltage. In the first type, the contact behaves like a P-N junction and 
rectifies current. The ohmic contact, on the other hand, passes current equally 
as a function of voltage polarity. In Section 10.4 the electrical properties of 
metal-semiconductor contacts will be treated in more detail. 

Contact technology has dramatically evolved since the first practical semi¬ 
conductor device, the point-contact rectifier, which employed a metal whisker 
that was physically pressed into the semiconductor surface. Today, deposited 
thin films of metals and metal compounds are used, and the choice is dictated 
by complex considerations; not the least of these is the problem of contact 
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P-Si 



Figure 8-18 . Schematic diagrams of silicide contacts in (a) bipolar and (b) MOS field 
effect transistor configurations. (Reprinted with permission from Ref. 17, © 1985 
Annual Reviews Inc.). 

instability during processing caused by mass-transport effects. For this reason, 
elaborate film structures are required to fulfill the electrical specifications and 
simultaneously defend against contact degradation. The extent of the problem 
can be appreciated with reference to Fig. 8-18, where both bipolar and MOS 
field effect transistors are schematically depicted. The operation of these 
devices need not concern us. What is of interest are the reasons for the Cr and 
metal silicide films that serve to electrically connect the Si below to the Al-Cu 
metal interconnections above. These bilayer structures have replaced the more 
obvious direct AJ-Si contact, which, however, continues to be used in other 
applications. Contact reactions between A1 and Si are interesting metallurgi¬ 
cal^ and provide a good pedagogical vehicle for applying previously devel¬ 
oped concepts of mass transport. A discussion of this follows. Means of 
minimizing Al-Si reactions through intervening metal silicide and diffusion- 
barrier films will then be reviewed. 
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8.5.2. Al - Si Reactions 

Nature has endowed us with two remarkable elements: Al and Si. Together 
with oxygen, they are the most abundant elements on earth. It was their destiny 
to be brought together in the minutest of quantities to make the computer age 
possible. Individually, each element is uniquely suited to perform its intended 
function in a device, but together they combine to form unstable contacts. In 
addition to creating either a rectifying barrier or ohmic contact, they form a 
diffusion couple where the extent of reaction is determined by the phase 
diagram and mass-transport kinetics. The processing of deposited Al films for 
contacts typically includes a 400 °C heat treatment. This enables the Al to 
reduce the very thin native insulating Si0 2 film and “sinter” to Si, thereby 
lowering the contact resistance. Reference to the Al-Si phase diagram (Fig. 
1-13) shows that at this temperature Si dissolves in Al to the extent of about 
0.3 wt%. During sintering, Si from the substrate diffuses into the Al via GB 
paths in order to satisfy the solubility requirement. Simultaneously, Al mi¬ 
grates into the Si by diffusion in the opposite direction. As shown by the 
sequence of events in Fig. 8-19, local diffusion couples are first activated at 
several sites within the contact area. When enough Al penetrates at one point, 
the underlying P-N junction is shorted by a conducting metal filament, and 
junction “spiking” or “spearing” is said to occur. 


Figure 8-19. 

tion spiking. 
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The remedy for the problem seems simple enough. By presaturating the A1 
with Si the driving force for interdiffusion disappears. Usually a 1 wt% Si-Al 
alloy film is sputtered for this purpose. However, with processing another 
complication arises. During the heating and cooling cycle Si is first held in 
solid solution but then precipitates out into the GBs of the A1 as the latter 
becomes supersaturated with Si at low temperatures. The irregularly shaped Si 
precipitate particles, saturated with Al, grow epitaxially on the Si substrate. 
Electrically these particles are P type and alter the intended electrical charac¬ 
teristics of the contact. Thus, despite ease in processing, Al contact metallurgy 
is too unreliable in the VLSI regime of very shallow junction depths. For this 
reason, noble metal silicides such as Pd-Si have largely replaced Al at 
contacts. 

There is yet another example of Al-Si reaction that occurs in field effect 
transistors. In this case, however, the contact to the gate oxide (Si0 2 ), rather 
than to the semiconductor source and drain regions, is involved. Historically, 
Al films were first used as gate electrodes, but, as noted on p. 24, they tend to 
reduce Si0 2 , which is undesirable. Other metals are also problematical 



Al > Si 





(d) 



Figure 8-20. Depiction of reactions between Al and polysilicon films during anneal¬ 
ing. Figures a, b, c refer to the case where ^ Sj > d A] . Figures d, e, f refer to case 
where d M > d Sl . (From Ref. 23). 
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because of the potential reaction to form a silicide as well as oxide; an example 
is 3Ti + 2Si0 2 -+ TiSi 2 + 2Ti0 2 . For reliable device performance, the fore¬ 
going considerations have led to the adoption of poly-Si films as the gate 
electrode. Although there is now no driving force promoting reaction between 
Si and Si0 2 , the chronic problem of Si-Al interdiffusion has re-emerged. The 
A1 interconnections must still make contact to the gate electrode. To make 
matters worse, reaction of A1 with poly Si is even more rapid than with 
single-crystal Si because of the presence of GBs. The dramatic alteration in the 
structure and composition in the Al-poly-Si-layered films following thermal 
treatment is shown schematically in Fig. 8-20. Reactions similar to those 
previously described for the Al-Si contact occur, and resultant changes are 
sensitive to the ratio of film thicknesses. It is easy to see why electrical 
properties would also be affected. Therefore, intervening silicide films and 
diffusion barriers must once again be relied on to separate A1 from Si. 


8.6. SlLICIDES AND DIFFUSION BARRIERS 


8.6.1. Metal Silicides 

In the course of developing silicides for use in contact applications, a great deal 
of fundamental research has been conducted on the reactions between thin 
metal films and single-crystal Si. Among the issues and questions addressed by 
these investigations are the following: 

1. Which silicide compounds form? 

2. What is the time and temperature dependence of metal silicide formation? 

3. What atomic mass-transport mechanisms are operative during silicide for¬ 
mation? Which of the two diffusing species migrates more rapidly? 

4. When the phase diagram indicates a number of different stable silicide 
compounds, which form preferentially and in what reaction sequence? 

Virtually all thin-film characterization and measurement tools have been 
employed at one time or another in studying these aspects of silicide formation. 
In particular, RBS methods have probably played the major role in shaping our 
understanding of metal-silicon reactions by revealing compound stoichiome¬ 
tries, layer thicknesses, and the moving specie. Examples of the spectra 
obtained and their interpretation have been discussed previously. (See Sec¬ 
tion 6.4.7). 

A summary of kinetic data obtained in silicide compounds formed with 
near-noble, transition, and refractory metals is contained in Table 8-2. This 
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Table 8-2, Silicide Formation 



Formation 

Activation 



Formation 


Temperature 

Energy 

Growth 

Moving 

Energy at 298 K 

Silicide 

(°C) 

(eV) 

Rate 

Specie 

(kcal/mole) 

Au 2 Si 

100 





Co 2 Si 

350-500 

1.5 

,*/2 

Co 

-27.6 

Ni 2 Si 

200-350 

1.5 

,1/2 

Ni 

-33.5 

NiSi 

350-700 

1.4 

t 


-20.5 

Pt 2 Si 

200-500 

1.5 

,1/2 

Pt 

-20.7 

PtSi 

300 

1.6 

,1/2 


-15.8 

FeSi 

450-550 

1.7 

,1/2 

Si 

- 19.2 

RhSi 

350-425 

1.95 

,1/2 

Si 

- 16.2 

HfSi 

550-700 

2.5 

,1/2 

Si 

-34 

IrSi 

400-500 

1.9 

,!/2 


-16.2 

CrSi 2 

450 

1.7 

t 


-28.8 

MoSi 2 

525 

3.2 

t 

Si 

-31.4 

WSi 2 

650 

3.0 


Si 

-22.2 

From Refs. 12 and 24. 





large body of work can be summarized 

in the following way: 


Silicide 

Formation Temp. (°C) 

Growth Rate 

Activation Energy (eV) 

M 2 Si 


200 

,1/2 

1.5 


MSi 


400 

,1/2 

1.6 

-2.5 

MSi 2 


600 

t 1 /2 

1.7 

-3.2 


Three broad classes of silicides are observed to form: the metal-rich silicide 
(e.g., M 2 Si), the monosilicide (MSi), and the silicon-rich silicide (e.g., 
MSi 2 ). As a rough rule of thumb, the formation temperature ranges from one 
third to one half the melting point (in K) of the corresponding silicide. Since 
fine-grained metal films are involved, it is not surprising that this rule is 
consistent with the GB diffusion regime. The activation energies roughly 
correlate with the melting point of the silicide, in agreement with general 
trends noted earlier. 

In the metal-rich silicides, the metal is observed to be the dominant mobile 
specie, whereas in the mono- and disilicides Si is the diffusing specie. The 
crucial step in silicide formation requires the continual supply of Si atoms 
through the breaking of bonds in the substrate. In the case of disilicides, high 
temperatures are available to free the Si for reaction. At lower temperatures 
there is insufficient thermal energy to cause breaking of Si bonds, and the 
metal-rich silicides thus probably form by a different mechanism. It has been 
suggested that rapid interstitial migration of metal through the Si lattice assists 
bond breaking and thus controls the formation of such silicides. 
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The sequence of phase formation has only been established in a few silicide 
systems. Perhaps the most extensively studied of these is the Ni-Si system, for 
which the phase diagram and compound formation map are provided in Fig. 
8-21. The map shows that Ni 2 Si is always the first phase to form during 
low-temperature annealing. Clearly, Ni 2 Si is not in thermodynamic equilib¬ 
rium with either Ni or Si, according to the phase diagram. What happens next 

ATOMIC PERCENT SILICON 
0 20 40 60 80 100 



Figure 8-21. Map of thin-film Ni silicide formation sequence. Phase diagram of 
Ni-Si system shown on top. (Reprinted with permission from Ref. 17, © 1985 Annual 
Reviews Inc.). 
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depends on whether Si or Ni is present in excess. In the usual former case, 
where a Ni thin film is deposited on a massive Si wafer, the sequence proceeds 
first to NiSi and then to NiSi 2 at elevated temperatures. However, when a film 
of Si is deposited on a thicker Ni substrate, then the second and third 
compounds become Ni 5 Si 2 and Ni 3 Si. At elevated temperatures the resultant 
two-phase equilibrium (i.e., Si—NiSi 2 or Ni-Ni 3 Si) conforms to the phase 
diagram. The question of the first silicide to form is a more complicated issue. 
It may be related to the ability to vapor-quench alloys to nucleate very thin, 
prior amorphous film layers. It is well known that bulk amorphous phases are 
readily formed by quenching metal-silicon eutectic melts. Therefore, it is 
suggested that silicide compounds located close to low-temperature eutectic 
compositions are the first to form. 

Interestingly, in bulk diffusion couples all compounds appear to grow 
simultaneously at elevated temperatures. This does not seem to happen in films 
(at low temperature), but more sensitive analytical techniques may be required 
to clarify this issue. 


8.6.2. Diffusion Barriers 

Diffusion barriers are thin-film layers used to prevent two materials from 
coming into direct contact in order to avoid reactions between them. Paint and 
electrodeposited layers are everyday examples of practical barriers employed 
to protect the underlying materials from atmospheric attack. In a similar vein, 
diffusion barriers are used in thin-film metallization systems, and the discus¬ 
sion will be limited to these applications. We have already noted the use of 
silicides to prevent direct Al-Si contact. Ideally, a barrier layer X sandwiched 
between A and B should possess the following attributes (Ref. 25): 

1. It should constitute a kinetic barrier to the traffic of A and B across it. In 
other words, the diffusivity of A and B in X should be small. 

2. It should be thermodynamically stable with respect to A and B at the highest 
temperature of use. Further, the solubility of X in A and B should be small. 

3. It should adhere well to and have low contact resistance with A and B and 
possess high electrical and thermal conductivity. Practical considerations 
also require low stress, ease of deposition, and compatibility with other 
processing. 

Some of these requirements are difficult to achieve and even mutually 
exclusive so that it is necessary to make compromises. 

A large number of materials have been investigated for use as barrier layers 
between silicon semiconductor devices and A1 interconnections. These include 
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Table 8-3. Aluminum-Diffusion-Barrier-Silicon Contact Reactions 


Diffusion 

Barrier 

Reaction 

Temperature 

(°C) 

Reaction 

Products 

Failure Mechansims 

Cr 

300 

Al 7 Cr 

C ( E c = 1.9 eV) 

V 

450 

A1 3 V, Al-V-Si 

C ( E c = 1.7 eV) 

Ti 

400 

Al 3 Ti 

C ( E c = 1.8 eV) 

Ti-W 

500 


D 

ZrN 

550 

Al-Zr-Si 

C 

PtSi 

350 

Al 2 Pt, Si 

C 

Pd 2 Si 

400 

Al 3 Pd, Si 

C 

NiSi 

400 

Al 3 Ni, Si 

C 

CoSi 2 

400 

A1 9 Co 2 , Si 

C 

TiSi 2 

550 

Al-Ti-Si 

D 

MoSi 2 

535 

A1 12 Mo, Si 

D 

Ti-Pd 2 Si 

435 

Al 3 Ti 

C 

W-CoSi 2 

500 

A1 i2 W 

C 

TiN-PtSi 

600 

AIN, Al 3 Ti 

C 

TiC-PtSi 

600 

A1 4 C 3 , Al 3 Ti 

C 

TaN-NiSi 

600 

AIN, Al 3 Ta 

C 


C = Compound formation; D = Diffusion 
From Ref. 26. 


silicides, refractory metals, transition metal alloys, transition metal com¬ 
pounds, as well as dual-layer barriers such as refractory metal-silicide, 
transition metal-silicide and transition metal compound-silicide combinations 
(Ref. 26). A compilation of these materials and reaction products is given in 
Table 8-3. Stringent physical requirements and the complexity of low-tempera¬ 
ture interdiffusion and reactions have frequently necessitated the use of “diffu¬ 
sion barriers’ 7 to protect diffusion barriers. In order to gain a complete picture 
of the effectiveness of diffusion barriers, we need analytical techniques to 
reveal metallurgical interactions and their effect on the electrical properties of 
devices. For this reason, RBS measurements and, to a lesser extent, SIMS and 
AES depth profiling have been complemented by various methods for deter¬ 
mining barrier heights ($ B ) of contacts (Section 10.4). Changes in $ B are a 
sensitive indicator of low-temperature reactions at the metal-Si interface. 

To appreciate the choice of barrier materials, we first distinguish among 
three models that have been proposed for successful diffusion-barrier behavior 
(Ref. 25). 

L Stuffed Barriers . Stuffed barriers rely on the segregation of impurities 
along otherwise rapid diffusion paths such as GBs to block further passage of 
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two-way atomic traffic there. The marked improvement of sputtered Mo and 
Ti-W alloys as diffusion barriers when they contain small quantities of 
intentionally added N or O impurities is apparently due to this mechanism. 
Impurity concentrations of - 10" 1 to 10 -3 at% are typically required to 
decorate GBs and induce stuffed-barrier protection. In extending the electromi- 
gation life of Al, Cu may in effect “stuff’ the conductor GBs. 

2. Passive Compound Barriers . Ideal barrier behavior exhibiting chemical 
inertness and negligible mutual solubility and diffusivity is sometimes approxi¬ 
mated by compounds. Although there are numerous possibilities among the 
carbides, nitrides, borides, and even the more conductive oxides, only the 
transition metal nitrides, such as TiN, have been extensively explored for 
device applications. TiN has proved effective in solar cells as a diffusion 
barrier between N-Si and Ti-Ag, but contact resistances are higher than 
desired in high-current-density circuits. 

3. Sacrificial Barriers . A sacrificial barrier maintains the separation of A 
and B only for a limited duration. As shown in Fig. 8-22, sacrificial barriers 
exploit the fact that reactions between adjacent films in turn produce uniform 
layered compounds AX and BX that continue to be separated by a narrowing X 
barrier film. So long as X remains and compounds AX and BX possess 
adequate conductivity, this barrier is effective. The first recognized application 
of a sacrificial barrier involved Ti, which reacted with Si to form Ti 2 Si and 
with Al to form TiAl 3 . Judging from the many metal aluminide and occasional 
Al-metal-silicon compounds in Table 8-3, sacrificial barrier reactions appear 
to be quite common. 

If the reaction rate kinetics of both compounds, i.e., AX, BX, are known, 
then either the effective lifetime or the minimum thickness of barrier required 
may be predicted. The following example is particularly instructive (Ref. 1). 
Suppose we consider a Ti diffusion barrier between Si and Al. Without 
imposition of Ti, the Al-Si combination is unstable. The question is, how 



Figure 8-22. Model of sacrificial barrier behavior. A and B films react with barrier 
film X to form AX and BX compounds. Protection is afforded as long as X is not 
consumed. (Reprinted with permission from Elsevier Sequoia, S.A., from M.-A. 
Nicolet, Thin Solid Films 52, 415, 1978). 
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much Ti should be deposited to withstand a thermal anneal at 500 °C for 15 
min? At the A1 interface, TiAl 3 forms with parabolic kinetics given by 

rf TiAi 3 = (1.5 X 10 ,5 )<r 1 ' 85eV/ * 7 7 (A 2 ), (8-27) 

where cf TiAl3 is the thickness of the TiAl 3 layer and t is the time in seconds. 
Similarly, the reaction of Ti with Si results in the formation of TiSi 2 with a 
kinetics governed by 

d \ iSi2 = (5.74 X 10 9 )e" 1,3eV/ * r / (A 2 ). (8-28) 

o o 

For the specified annealing conditions, tf TiA , 3 = 1100 A and d riSl2 = 130 A. 
An insignificant amount of Ti is consumed under ambient operating conditions. 
Therefore, the minimum thickness of Ti required is the sum of these two 
values, or 1230 A. 

In conclusion, we note that semiconductor contacts are thermodynamically 
unstable because they are not in a state of minimum free energy. The 
imposition of a diffusion barrier slows down the equilibration process, but the 
instability is never actually removed. Enhanced reliability is bought with 
diffusion barriers, but at the cost of increasing structural complexity and added 
processing expense. 


8.7. Diffusion During Film Growth 


We close the chapter by considering diffusion effects in films growing within a 
gas-phase ambient. In addition to the diffusional exchange between gas atoms 
and growing film, or the redistribution of atoms between film and substrate, 
there is the added complexity of transport across a moving boundary. Such 
effects are important in high-temperature oxidation of Si, one of the most- 
studied film growth processes. The resulting amorphous Si0 2 films find 
extensive use in microelectronic applications as an insulator, and as a mask 
used to pattern and expose some regions for processing while shielding other 
areas. In contrast to film deposition, where the atoms of the deposit originate 
totally from the vapor phase (as in CVD of Si0 2 ), oxidation relies on the 
reaction between Si and oxygen to sustain oxide film growth. This means that 
for every 1000 A of Si0 2 growth, 440 A (i.e., 1000p SiO2 M Si /p s i^sio 2 ) °f Si 
substrate is consumed. The now-classic analysis of oxidation due to Grove 
(Ref. 27) has a simple elegance and yet accurately predicts the kinetics of 
thermal oxidation. In this treatment of the model, we assume a flow of gas 
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containing oxygen parallel to the plane of the Si surface. In order to form oxide 
at the Si-Si0 2 interface, the following sequential steps are assumed to occur: 

1. Oxygen is transported from the bulk of the gas phase to the gas-oxide 
interface. 

2. Oxygen diffuses through the growing solid oxide film of thickness d 0 . 

3. When oxygen reaches the Si-Si0 2 interface, it chemically reacts with Si 
and forms oxide. 


The respective mass fluxes corresponding to these steps can be expressed by 


J\ — ^c(Cg C 0 ), 

(8-29) 

J 2 = D(C 0 - C,)/d 0 , 

(8-30) 

J 3 = a : s c,, 

(8-31) 


where the concentrations of oxygen in the bulk of the gas, at the gas-Si0 2 
interface, and at the Si0 2 -Si interface are respectively, C c , C 0 , and C t . The 
quantities h G , D, and K s represent the gas mass-transport coefficient, the 
diffusion coefficient of oxygen in Si0 2 , and the chemical reaction rate 
constant, respectively. Constants D and K s display the usual Boltzmann 
behavior but with different activation energies, and h G has a weak temperature 
dependence. 

By assuming steady-state growth implying 7, = J 2 — » we easily solve 

for C { and C 0 in terms of C G : 


C G ( 1 + K s d 0 /D) 

1 +K s /h G + K s dJD' 


(8-32a) 


C, = 


1 + K s /h G + K s d 0 /D 


(8-32b) 


Clearly, the grown Si0 2 has a well-fixed stoichiometry so that C 0 and C, 
differ only slightly in magnitude, but sufficiently to establish the concentration 
gradient required for diffusion. In fact, C 0 = C, = C c /(1 + K s /h c ) in the 
so-called reaction-limited case where D > K s d 0 . Here, diffusion is assumed 
to be very rapid through the Si0 2 , but the bottleneck for growth is the 
interfacial chemical reaction. On the other hand, under diffusion control, D is 
small so that C 0 ~ C G and C, ~ 0. In this case the chemical reaction is 
sufficiently rapid, but the supply of oxygen is rate-limiting. The actual oxide 
growth rate is related to the flux, say / 3 , and therefore the thickness of oxide 
at any time is expressed by 

d(d 0 )/dt = K s Cj / N 0 , 


( 8 - 33 ) 
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where N 0 is the number of oxidant molecules incorporated into a unit volume 
of film. For oxidation in dry 0 2 gas, N 0 = 2.2 x 10 22 cm" 3 , whereas for 
steam (wet) oxidation N 0 = 4.4 X 10 22 cm -3 , because half as much oxygen is 
contained per molecule. 

Substitution of Eq. 8-32b into 8-33 and direct integration of the resulting 
differential equation yields 

d 2 0 + Ad 0 = B(t + t), (8-34) 

where 



2 DCn d 2 + Ad, 

B = - - , and r = —--. 

N 0 B 


The constant of integration r arises only if there is an initial oxide film of 
thickness </, present prior to oxidation, and therefore Eq. 8-34 is useful in 
describing sequential oxidations. A solution to this quadratic equation is 

d o r . t + r . 


= tl + 


A 2 /4B 


(8-35) 


from which the limiting long- as well as short-time growth kinetics relation¬ 
ships are easily shown to be 

d 2 * Bt for t > A 2 f4B, (8-36) 


d ° “ + 


for / + r ^ 


The reader will recall similar parabolic and linear growth in metal compound 
and silicide films. All film growth is probably linear to begin with because 
parabolic growth implies an infinite initial thickening rate. 

Values for the parabolic and linear rate constants for Si0 2 , grown from 
(111) Si, are approximately (Ref. 28) 

0.71 eV 

B - 186 exp-—— /im 2 /hr (wet0 2 ), (8-38a) 


B = 950 exp 


kT 

1.24 eV 


jim 2 /hr (dry 0 2 ), (8-38b) 


= 7.31 x 10 7 exp 


— = 5.89 x 10 6 exp 
A 


1.96 eV 
kT 
2.0 eV 


- jim/hr (wet0 2 ), (8-39a) 

jim/hr (dry 0 2 ), (8-39b) 


where all constants are normalized to 760 torr. Equations 8-36 and 8-37 serve 
as an aid in designing oxidation treatments. Different activation energies for B 
are obtained in wet and dry 0 2 because the migrating species in each case is 
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Figure 8-23. Oxidation kinetics behavior of Si in terms of dimensionless oxide 
thickness and time. The two limiting forms of the kinetics are shown. From A. S. Grove, 
Physics and Technology of Semiconductor Devices , Copyright © 1967, John Wiley and 
Sons. (Reprinted with permission.) 

different, e.g., H 2 0 and OH as opposed to 0 2 or O. However, the activation 
energy for Bf A reflects the chemical reaction at the Si-Si0 2 interface and is 
the same regardless of the nature of the oxygen-bearing diffusant. A single 
dimensionless thickness-time plot shown in Fig. 8-23 very neatly summarizes 
Si oxidation behavior. The limiting linear and parabolic growth kinetics 
regimes are clearly identified. 

Not all oxidation processes, however, display linear or parabolic growth 
kinetics. Some examples are presented in Chapter 12 in connection with 
protective oxide coatings. 



Exercises 



1. a. Establish generalized expressions for the lattice diffusivity as a func¬ 
tion of temperature for semiconductors and alkali halides, using 
Fig. 5-6. 
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b. How does your expression for D L compare with the diffusivity values 
for Si in Si (self-diffusion) in Fig. 8-3? 

2. A P-N junction is produced by diffusing B from a continuous source 
(C 0 = 10 19 cm -3 ) into an eptiaxial Si Film with a background N level of 
10 15 cm 3 . Diffusion is carried out at 1100 °C for 30 min. 

a. How far beneath the Si surface is the junction (i.e., where C N = C P ). 
Use Eq. l-27a. 

b. If there is a 1 % error in temperature, what is the percent change in 
junction depth? 

c. By what percent will the junction depth change for a 1 % change in 
diffusion time? 

3- At what temperature will the number of Au atoms transported through 
grain boundaries equal that which diffuses through the lattice if the grain 
size is 2 fim? 20 fim? 

4. The equation for transport of atoms down a single grain boundary where 
there is simultaneous diffusion into the adjoining grains is 

= _ 2 P L / dC L \ 

dt b dy 2 8 \ dx ) x= s /2 


Derive this equation by considering diffusional transport into and out of 
an element of grain boundary 6 wide and dy long. 

5- In the Pd-Au thin-film diffusion couple an approximate Fit to the data of 
Fig. 8-7 can be made employing the equation 


C(x, () = — erfc 


x 

2 VDt ' 


a. Plot C( x, 0 vs. x. 

b. From values of dC/dx | ^o, estimate the values of D for the 0-h, 
20-h, and 200-h data. Are these D values the same? 

c. What accounts for the apparent interdiffusion between Au and Pd at 0 
h? 

6 . a. Calculate the activation energy for dislocation pipe diffusion of Au in 
epitaxial Au Films from the data of Fig. 8-6a. 
b. Calculate the activation energy for grain-boundary diffusion of Au in 
polycrystalline Au Films from the data of Fig. 8-6b. 
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In both cases make Arrhenius plots of the diffusivity data. Assume 

41.7 (kcal/mole) 

D l = 0.091 exp-cm 2 /sec. 

RT 

7. An TV-type dopant from a continuous source of concentration C 0 is 
diffused into a P-type semiconductor film containing a single grain 
boundary oriented normal to the surface. If the background dopant level 
in the film is C B , write an expression for the resulting P-N junction 
profile (y vs. x ) after diffusion. 

8 - A l-/*m-thick film of Ni was deposited on a Si wafer. After a 1-h anneal 
at 300 °C, 600 A of Ni 2 Si formed. 

a. Predict the thickness of Ni 2 Si that would form if the Ni-Si couple 
were heated to 350 °C for 2 h. 

o 

b. In forming 600 A of Ni 2 Si, how much Si was consumed? [Note: The 
atomic density of Ni is 9 X 10 22 atoms/cm 3 .] 

c. The lattice parameters of cubic Ni 2 Si and Si are 5.406 A and 5.431 A, 
respectively. Comment on the probable nature of the compound-sub¬ 
strate interface. 

9- The thermal stability of a thin-film superlattice consisting of an alternating 
stack of 100-A-thick layers of epitaxial GaAs and AlAs is of concern. 

a. If chemical homogenization of the layers is limited by the diffusion of 

o 

Ga in GaAs, estimate how long it will take Ga to diffuse 50 A at 
25 °C? 

b. Roughly estimate the temperature required to produce layers of 

composition Ga 0 75 AI 0 . 25 Ga 0 .25^ o 75 As after a 1-h anneal. 

10. For electromigration in A1 stripes assume E e — 0.7 eV and n = 2.5 in 
Eq. 8-23. By what factor is MTF shortened (or extended) at 40 °C by 

a. a change in E e to 0.6 eV? 

b. no change in E e but a temperature increase to 85 °C? 

c. a decrease in stripe thickness at a step from 1.0 to 0.75 finrt 

d. an increase in current from 1 to 1.5 mA? 

11 . a. When there are simultaneous electromigration and diffusional fluxes of 

atoms, show that 

dC ^d 2 C dC 


with v defined by Eq. 8-25. 
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b. For a diffusion couple (C = C 0 for x < 0 and C = 0 for x > 0) 
show that 


C(x,t) = -y 


VX X + vt 

exp — erfc - — r 
D V4 Dt 


4- erfc 


x — vt 
V4 Dt 


is a solution to the equation in part (a) and satisfies the boundary 
conditions. 

c. A homogeneous A1 film stripe is alloyed with a cross stripe of Cu 
creating two interfaces; ( + ) Al-Cu/Al and (-) Al/Al-Cu. Show, 
using the preceding solution, that the concentration profiles that de¬ 
velop at these interfaces obey the relation. 


C + (x,t) vx 

In-;-- = — . 

C_(x, t) D 


12 . The surface accumulation interdiffusion data of Fig. 8-1 la can be fitted to 
the normalized equation C s = 1 - exp - S(t - t 0 ). For 

run 1: S = 7.1 x 10 -1 sec -1 , 

run 2: S = 1.4 x 10” 1 2 3 sec -1 , 

run 3: S = 7.4 x 10 -4 sec -1 , 

run 4: S = 2.6 X 10 -5 sec -1 . 

a. If S is thermally activated, i.e., S = S 0 exp - E s /kT (S 0 = constant), 
make an Arrhenius plot and determine the activation energy for 
diffusion of Ag in Au films. 

b. What diffusion mechanism is suggested by the value of E S 1 

13- a. Compare the time required to grow a 3500 A thick Si0 2 film in dry as 
opposed to wet 0 2 at 1100 °C. Assume the native oxide thickness is 
30 A. 

o 

b. A window in a 3500 A Si0 2 film is opened down to the Si substrate in 
order to grow a gate oxide at 1000 °C for 30 minutes in dry 0 2 . Find 
the resulting thickness of both the gate and surrounding (field) oxide 
films. 
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Chapter 9 




Mechanical Properties 
of Thin Films 


9.1- Introduction 


Interest in mechanical-property effects in thin films has focused on two major 
issues. The primary concern has been with the deleterious effects that stress 
causes in films. This has prompted much research to determine the type, 
magnitude, and origin of stress as well as means of minimizing or controlling 
stresses. A second important concern is related to enhancing the mechanical 
properties of hardness and wear resistance in assorted coating applications. 
The topic of stress in films has historically generated the greatest attention and 
will be our major interest in this chapter. A discussion of the mechanical 
properties of metallurgical and protective coatings is the focus of Chapter 12. 

It is virtually always the case that stresses are present in thin films. What 
must be appreciated is that stresses exist even though films are not externally 
loaded. They directly affect a variety of phenomena, including adhesion, 
generation of crystalline defects, perfection of epitaxial deposits, and formation 
of film surface growths such as hillocks and whiskers. Film stresses that tend 
to increase with thickness are a prime limitation to the growth of very thick 
films because they promote film peeling. In addition, film stresses influence 
band-gap shifts in semiconductors, transition temperatures in superconductors, 
and magnetic anisotropy. Substrate deformation and distortion also necessarily 
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arise from stresses in the overlying films. In most applications, this is not a 
troublesome issue because substrates are usually relatively massive compared 
to films. In integrated circuit technology, however, even slight bowing of 
silicon wafers presents significant problems with regard to maintaining precise 
tolerances in the definition of device features. 

The existence of stresses in thin electrodeposited films has been known since 
1858, when the English chemist Gore noted (Ref. 1): “In electrodeposits 
generally the inner and outer surfaces are in unequal states of cohesive tension 
frequently in so great a degree as to rend the deposit extensively and raise it 
from the cathode in the form of a curved sheet with its concave side to the 
anode.” The concave bending of the cantilevered cathodic electrode implies, 
as we shall see, a tensile stress in the deposit. Despite the passage of years, the 
origins of stress in electrodeposited films are still not completely understood, 
A similar state of affairs exists in both physical and chemical vapor-deposited 
thin films; stresses exist, they can be measured, but their origins are not known 
with certainty. 

There is a great body of information on mechanical effects in bulk materials 
that provides a context for understanding film behavior. At one extreme is the 
elastic regime, rooted in the theory of elasticity, which forms the basis of 
structural mechanics and much engineering design. Here the material elonga¬ 
tions (i.e., strains) are linearly proportional to the applied forces (i.e., stresses). 
Upon unloading, the material snaps back and regains its original shape. At the 
other extreme are the irreversible plastic effects induced at stress levels above 
the limit of the elastic response (i.e., the yield stress). All sorts of mechanical 
forming operations in materials (e.g., rolling, extrusion, drawing) as well as 
failure phenomena (e.g., creep, fatigue, fracture) are manifestations of 
plastic-deformation effects. Plasticity, unlike elasticity, is difficult to model 
mathematically because plastic behavior is nonlinear and strongly dependent on 
the past thermomechanical processing and treatment history of the material. 

In the packaging and attachment of semiconductor chips to circuit modules 
and boards, a new collection of structural-mechanics applications has recently 
emerged (Ref. 2). The involved components are small, thicker than thin films, 
but vastly smaller than conventional engineering structures. Nevertheless, our 
understanding of the mechanical behavior of electronic packaging materials 
used—metals (e.g., Pb-Sn, Al), ceramics (e.g., A1 2 0 3 , Si0 2 glasses), semi¬ 
conductors (e.g., Si, GaAs), and polymers (e.g., epoxies, polyimide)—and the 
structural mechanics of combinations of involved components (e.g., solder 
bumps and joints, chip bonding pads, die supports, encapsulants, etc.) has 
evolved from well-established bulk elastic and plastic phenomena and analyses. 

It comes as no surprise that the varied mechanical properties of thin films 
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also span both the elastic and plastic realms of behavior. For this reason we 
begin with an abbreviated review of relevant topics dealing with these classic 
subjects prior to the consideration of mechanical effects in films. The chapter 
outline follows: 

9.2. Introduction to Elasticity, Plasticity, and Mechanical Behavior 

9.3. Internal Stresses and Their Analyses 

9.4. Stress in Thin Films 

9.5. Relaxation Effects in Stressed Films 

9.6. Adhesion 


9.2. Introduction to Elasticity, Plasticity, and 
Mechanical Behavior 


9.2.1. Elastic Regime 

An appropriate way to start a discussion of mechanical properties is to consider 
what is meant by stress. When forces are applied to the surface of a body, they 
act directly on the surface atoms. The forces are also indirectly transmitted to 
the internal atoms via the network of bonds that are distorted by the internally 
developed stress field. If the plate in Fig. 9-la is stretched by equal and 
opposite axial tensile forces F y then it is both in mechanical equilibrium as 
well as in a state of stress. Since the plate is in static equilibrium, it can be cut 
as shown in Fig. 9-lb, revealing that internal forces must act on the exposed 
surface to keep the isolated section from moving. Regardless of where and at 
what orientation the plate is cut, balancing forces are required to sustain 
equilibrium. These internal forces distributed throughout the plate constitute a 
state of stress. In the example shown, the normal force F divided by the area 
A defines the tensile stress a x . Similarly, normal stresses in the remaining two 
coordinate directions, o y and a z , can be imagined under more complex loading 
conditions. If the force is directed into the surface, a compressive stress arises. 
Convention assigns it a negative sign, in contrast to the positive sign for a 
tensile stress. In addition, mechanical equilibrium on internal surfaces cut at an 
arbitrary angle will generally necessitate forces and stresses resolved in the 
plane itself. These are the so-called shear stresses. The tensile force of Fig. 
9-la produces maximum shear stresses on planes inclined at 45° with respect 
to the plate axis (Fig. 9-la). If the normal tensile stress a x = F / A, then the 
force resolved on these shear planes is F cos 45 0 = V2 /2 F. The area of the 
shear planes is A /cos45° =2/ Vi A. Therefore, the shear stress r = F/2 A 
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c. 



Tyx 

Y xy = Tan a 


Figure 9-1. (a) Tensile force applied to plate, (b) Arbitrary free-body section reveal¬ 

ing spatial distribution of stress through plate. Both tensile and shear stresses exist on 
exposed plane, (c) Distortion in plate due to applied shear stress. 

and is half that of the tensile stress. Shear stresses are extremely important 
because they are essentially responsible for the plastic deformation of crys¬ 
talline materials. Two subscripts are generally required to specify a shear 
stress: the first to denote the plane in which shear occurs, and the second to 
identify the direction of the force in this plane. If, for example, a shear force 
were applied to the top surface of the plate, it distorts into a prism. For all 
forces and moments to balance, a tetrad of shear stresses must act on the 
horizontal as well as vertical faces. It is left as an exercise to show that 
r xy = r yx in equilibrium, and similarly, for T yz and t xz . 

The application of tensile forces extends the plate of Fig. 9-1 by an amount 
A/. This results in a normal strain e x , defined by e x = A/// 0 , where / 0 is the 
original length, similarly, in other directions the normal strains are e v and . 
In the example given, the plate also contracts laterally in both the y and z 
directions in concert with the longitudinal extension in the x direction. 
Therefore, even though there is no stress in the y direction, there is a strain s y 
given by e y = —ve x \ similarly for e z . The quantity v y a measure of this 
lateral contraction, is Poisson’s ratio, and for many materials it has a value of 
about 0.3. Under the action of shear stresses, shear strains ( 7 ) are induced; 
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these are essentially defined by the tangent of the shear distortion angle a in 
Fig. 9-lc. 

In the elastic regime, all strains are small, and Hooke’s law dominates the 
response of the system; i.e., 

o x = Ee x , (9-la) 

where E is Young’s modulus. (Values of E are entered in Tables 9-2 and 12-1 
for a variety of materials of interest.) When a three-dimensional state of stress 
exists, 

S X= {^/E)[a x -t>{(Jy + o z )\ (9-lb) 

(similarly for e y and e z ). This formula simplifies to Eq. 9-la in the absence of 
o y and a z . For shear stresses, Hooke’s law also applies in the form 

T xy = Wxy (9-2) 

(similarly for t xz and r yz ) 9 where fi is the shear modulus. These equations 

only strictly apply to isotropic materials where of the three elastic constants E , 
fi,, and v , only two are independent; they are connected by the relation 
M = E/ 2(1 + v). 

In anisotropic media, such as the single-crystal quartz plate used to monitor 
film thickness during deposition (Chapter 6), the elastic constants reflect the 
noncubic symmetry of the crystal structure. Although there are more elastic 
constants to contend with, Hooke’s law is still valid. In addition to describing 
stress-strain relationships, elasticity theory is concerned with specifying the 
stress and strain distribution throughout the volume of shaped bodies subjected 
to arbitrary loading conditions. In Section 9.3, we employ some of the simpler 
concepts to derive basic formulas used to determine the stress in films. 

9.2.2. Tensile Properties of Thin Metal Films 

Tensile tests are widely used to evaluate both the elastic and plastic response of 
bulk materials. Although direct tensile tests in films are not conducted with 
great frequency today, past measurements are interesting because of the basic 
information they conveyed on the nature of deformation processes in metal 
films. Many of the results and their interpretations can be found in the old but 
still useful review by Hoffman (Ref. 3). Unlike its bulk counterpart, tensile 
testing of thin films is far from a routine task and has all the earmarks of a 
research effort. The extreme delicacy required in the handling of thin films has 
posed a great experimental challenge and stimulus to the ingenuity of investiga¬ 
tors. Detachment of films from substrates and methods for gripping and 
aligning them, applying loads, and measuring the resultant mechanical re- 



408 


Mechanical Properties of Thin Films 


sponse are some of the experimental problems. Loading is commonly achieved 
by electromagnetic force transducers, and strains are typically measured by 
optical methods. The most novel microtensile testing devices have been 
incorporated within electron microscopes, enabling direct observation of de¬ 
fects and recording of diffraction patterns during straining. 

A typical stress-strain curve for gold is shown in Fig. 9-2a. The maximum 
tensile strength, or stress at fracture in this case, is only somewhat higher than 
typical intrinsic or residual stress levels about which we will speak later. The 
strain at fracture is only - 0.8%, which is much more than an order of 
magnitude smaller than that observed in bulk Au. Loading and unloading 
curves of varying slope (or modulus of elasticity) have raised questions of 
whether E differs from the bulk value and whether it is film-thickness-depen- 




Figure 9-2. (a) Stress-strain behavior for a Au film. (From Ref. 4). (b) Load-deflec¬ 

tion behavior for a 0.87-/*m-thick Au cantilever film. (From Ref. 7). 
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dent. On both accounts experimental data indicate no abnormal effects. Above 
the elastic limit there is considerable evidence for plasticity in the form of 
observed dislocation motion, stress relaxation, and creep effects as well as 
regions of localized thinning. Unlike bulk metals, there is no regime of easy 
dislocation glide in films; rather, poly crystalline metal films sometimes contain 
initial dislocation densities of 10 10 to 10 12 cm -2 , which are higher by an order 
of magnitude or so than those found in heavily worked and strain-hardened 
bulk metals. 

The surprisingly high tensile strengths of metal films have been the subject 
of much interest. Typical strengths exceed those for hard-drawn metals by a 
factor of 2 to 10 , and may be as much as 100 times that of annealed bulk 
metals. Polycrystal films are usually stronger than single-crystal films, reflect¬ 
ing the role of grain boundaries as obstacles to dislocation motion. Reported 
values for the maximum tensile stresses or strengths of FCC metal films span a 
range from ji/40 to p/120 with a wide degree of scatter, particularly when 
results from different laboratories are compared. The highest strengths are 
close to those theoretically predicted, assuming deformation occurs by rigid 
lattice displacements. This means that the high dislocation densities in films 
leave few avenues available for either generation or motion of new disloca¬ 
tions. 

Specific microscopic mechanisms for the strength of films are quite detailed. 
Some insight into what is involved, however, can be gained by considering two 
relationships borrowed from theories of mechanical behavior of bulk materials. 
The first simply estimates the shear stress required to cause dislocations to 
effectively bypass obstacles situated a distance I apart and thereby produce 
plastic deformation. Thus, 

r = pb/t 9 (9-3) 

where b is the Burgers vector. The distance / is usually taken as the spacing 
between dislocation pinning points such as precipitates, grain boundaries, or 
other dislocations. If / is imagined to be the film thickness ( d ), then it is clear 
that film strength is predicted to vary inversely with d. The second relationship 
is the Hall-Petch equation, which connects the yield stress to grain size l g . 
Variants of this equation relate ultimate tensile strength a TS to grain size as 
well. Therefore, 

o TS =a a + Kl-'V. (9-4) 

In a modified version, the film thickness replaces the mean grain size. This 
is a justifiable substitution since the two quantities are indirectly related. In Eq. 
9-4, o a is an intrinsic stress level and K is a constant. Again on the basis of 
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this formula, thinner films are expected to be stronger. Although this is 
certainly true in the scaling down from (bulk) micron-thick foils to submicron- 
thick films, it is not certain whether there is a thinness beyond which no 
further strengthening occurs. Experimental data are contradictory with respect 
to this issue. 


9.2.3. Bulge Testing of Films 


Bulge testing is widely used to determine the mechanical properties of thin 
films and membranes. In this test the film-substrate assembly is sealed to the 
end of a hollow cylindrical tube so that it can be pressurized with gas. The 
maximum height of the resulting hemispherical bulge in the film is then 
measured optically with a microscope or interferometer and converted to 
strain. A relationship between the dome height h produced by the differential 
pressure P has been determined to be (Ref. 3) 


P = 


4 dh 


2 

a* + - ■ 


E h 2 


31 - v r l 


(9-5) 


The film thickness and specimen radius are d and r, respectively, and <r 0 is 
the residual stress in the film under a zero pressure differential. To illustrate 
bulge testing and how data are analyzed, we consider a recent study conducted 
on both epitaxial and poly crystalline Si membranes (Ref. 5). These materials 
are candidates for X-ray lithography mask substrates, an application requiring 
high fracture strength and excellent dimensional stability. Freestanding mem¬ 
branes 1 fim thick and 38 mm in diameter, supported by a thicker substrate 
ring of either Si or Si0 2 , were prepared by selective masking and etching 
methods. 

The pressure-deflection response for the polycrystalline membrane is shown 
in Fig. 9-3a. An excellent fit of the data to Eq. 9-5 is evident, enabling values 
of a 0 and E/(l - v) to be determined. Despite the nonlinear membrane 
deflection with pressure, the response is actually elastic and not plastic. 
Repeated pressurization cycles did not result in any appreciable deterioration of 
reproducibility in mechanical response. In Fig. 9-3b the membrane is stressed 
to failure at higher pressure levels. The fracture stress is given by oy = 
P f r 2 14 dh f , where P f and h f are the values at fracture. The following 
results compare the properties of the two Si film materials. 


E ( 10 12 dynes/cm 2 ) ct 0 (10 9 dynes/cm 2 ) a f (10 9 dynes/cm 2 ) 


Epitaxial 
Poly crystal line 


1.8 

2.5 


0.52 

1.7 


1.9 

4.2 
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9.2.4. Other Testing Methods 

Recent years have witnessed the development of new techniques to measure the 
mechanical properties of films under the application of minute loads. Simulta¬ 
neously, very small displacements are detected so that a continuous 
“stress-strain”-like curve is obtained. These techniques are based on the use 
of the Nanoindenter, a load-controlled submicron indentation instrument that is 
commercially available (Ref. 6). Its chief application has been indentation 
hardness testing, a subject more fully treated in Chapter 12. In addition to 
hardness, indentation tests have been used to indirectly measure a wide variety 
of mechanical properties in bulk materials, such as flow stress, creep resis¬ 
tance, stress relaxation, fracture, toughness, elastic modulus, and fatigue 
behavior. 

Similar future tests on films and coatings with the Nanoindenter are certain. 
In this instrument the typical resolution of the displacment-sensing system is 
2 A, and that of the loading system is 0.3 jxN. Due to the very small volume 
(— 1 urn 3 ) sampled, the technique may be regarded as a mechanical properties 
microprobe by analogy to chemical microprobes (e.g., AES, SIMS). In one 
application, the Nanoindenter loading mechanism was used to load and mea¬ 
sure the deflection of cantilever microbeams (Ref. 7). The latter were fabri¬ 
cated from Si wafers by employing micromachining techniques (Section 
14.1.2). These involve standard photolithographic and etching processes bor¬ 
rowed from microelectronics technology to generate a variety of geometric 
shapes. Dimensions are larger than those employed in VLSI technology, but 
significantly smaller than what can be machined or fabricated by traditional 
methods. Thus freestanding microbeams - 1 fim thick, 20 fxm wide, and 20 
/xm long have been fabricated and tested by employing the experimental 
arrangement depicted schematically in Fig. 9-4. By evaporating or growing 
films on the Si substrate, then etching the latter away, we can extend the 
technique to different film materials. A typical beam load-deflection curve for 
a Au microbeam is shown in Fig. 9-2b. During loading, the linear elastic, as 
well as nonlinear, strain-hardening plastic regimes are observed. Formulas for 
the elastic deflection 

8 = 4F(1 - v^P/wd^E, (9-6) 

and yield stress 

o y = 6lF y /wd 2 (9-7) 

of the beam enable E and o y to be extracted from the data. Here /, w, and d 

are the beam length, width, and thickness, and F y is the load marking the 

deviation from linearity in the loading response. 



9.3. Internal Stresses and Their Analyses 


413 



Figure 9-4. Schematic of Nanoindenter loading mechanism and a cantilever mi¬ 
crobeam. (From Refs. 6 and 7, with permission from the Materials Research Society). 

Further information on microbeam deflection and related submicron indenta¬ 
tion testing techniques for films and coatings is provided in the recent review 
by Nix (Ref. 7). This reference is highly recommended for its treatment of 
misfit dislocations in epitaxial films (a subject already discussed in Chapter 7), 
as well as coverage of elastic and plastic phenomena including internal stress, 
strength, and relaxation effects in films. It is to these topics that we now turn 
our attention. 


9.3. Internal Stresses and Their Analysis 
9.3.1. Internal Stress 

Implicit in the discussion to this point is that the stresses and the effects they 
produce are the result of externally applied forces. After the load is removed, 
the stresses are expected to vanish. On the other hand, thin films are stressed 
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even without the application of external loading and are said to possess internal 
or residual stresses. The origin and nature of these internal residual stresses are 
the sources of many mechanical effects in films and a primary concern of this 
chapter. 

Residual stresses are, of course, not restricted to composite film-substrate 
structures, but occur universally in all classes of homogeneous materials under 
special circumstances. A state of nonuniform plastic deformation is required, 
and this frequently occurs during mechanical or thermal processing. For 
example, when a metal strip is reduced slightly by rolling between cylindrical 
rolls, the surface fibers are extended more than the interior bulk. The latter 
restrains the fiber extension and places the surface in compression while the 
interior is stressed in tension. This residual stress distribution, is locked into 
the metal, but can be released like a jack-in-the-box. Machining a thin surface 
layer from the rolled metal will upset the mechanical equilibrium and cause the 
remaining material to bow. Residual stresses arise in casting, welds, machined 
and ground materials, and heat-treated glass. The presence of residual stresses 
is usually undesirable, but there are cases where they are beneficial. Tempered 
glass and shot-peened metal surfaces rely on residual compressive stresses to 
counteract harmful tensile stresses applied in service. 

A model for the generation of internal stress during the deposition of films is 
illustrated in Fig. 9-5. Regardless of the stress distribution that prevails, 
maintenance of mechanical equilibrium requires that the net force ( F ) and 



FILM UNDER RESIDUAL FILM UNDER RESIDUAL 

TENSILE STRESS COMPRESSIVE STRESS 

Figure 9-5. Sequence of events leading to (a) residual tensile stress in film; (b) 
residual compressive stress in film. 
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(b) 

Figure 9-6. Stresses in silver lithium thin films: (a) tensile film failures during 
deposition; (b) compressive film failures during aging in Ar. (Courtesy of R. E. 
Cuthrell, Sandia National Lab). 
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bending moment (M) vanish on the film-substrate cross section. Thus 

F — J a dA = 0, (9-8a) 

M = J ay dA =0, (9-8b) 

where A is the sectional area and y is the moment lever arm. Intuitive use 
will be made of these basic equations, and they are applied analytically in 
deriving the Stoney formula in the next section. In the first type of behavior 
shown in Fig. 9-5a, the growing film initially shrinks relative to the substrate. 
Surface tension forces are one reason why this might happen; the misfit 
accompanying epitaxial growth is another. Compatibility, however, requires 
that both the film and substrate have the same length. Therefore, the film is 
constrained and stretches, and the substrate accordingly contracts. The tensile 
forces developed in the film are balanced by the compressive forces in the 
substrate. However, the combination is still not in mechanical equilibrium 
because of the uncompensated end moments. If the film-substrate pair is not 
restrained from moving, it will elastically bend as shown, to counteract the 
unbalanced moments. Thus, films containing internal tensile stresses bend the 
substrate concavely upward. In an entirely similar fashion, compressive stresses 
develop in films that tend to initially expand relative to the substrate (Fig. 
9-5b). Internal compressive film stresses, therefore, bend the substrate con- 
vexly outward. These results are perfectly general regardless of the specific 
mechanisms that cause films to stretch or shrink relative to substrates. Some¬ 
times the tensile stresses are sufficiently large to cause film fracture. Similarly, 
excessively high compressive stresses can cause film wrinkling and local loss 
of adhesion to the substrate. Examples of both effects are shown in Fig. 9-6. A 
discussion of the causes of internal stress will be deferred until Section 9.4. 
We now turn our attention to a quantitative calculation of film stress as a 
function of substrate bending. 

9.3.2. The Stoney Formula 

The formulas that have been used in virtually all experimental determinations 
of film stress are variants of an equation first given by Stoney in 1909 (Ref. 8). 
This equation can be derived with reference to Fig. 9-7, which shows a 
composite film-substrate combination of width w. The film thickness and 
Young’s modulus are d f and E f , respectively, and the corresponding sub¬ 
strate values are d s and E s . Due to lattice misfit, differential thermal 
expansion, film growth effects, etc., mismatch forces arise at the film-sub- 
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Figure 9-7. Stress analysis of film-substrate combination: (a) composite structure; 
(b) free-body diagrams of film and substrate with indicated interfacial forces and end 
moments; (c) elastic bending of beam under applied end moment. 

strate interface. In the free-body diagrams of Fig. 9-7b each set of interfacial 
forces can be replaced by the statically equivalent combination of a force and 
moment: F f and M f in the film, F s and M s in the substrate, where F f = F s . 
Force F f can be imagined to act uniformly over the film cross section (d f w) 
giving rise to the film stress. The moments are responsible for the bowing of 
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the film-substrate composite. Equation 9-8b requires equality of the clockwise 
and counterclockwise moments, a condition expressed by 

{(d f +d s )/2)F f =M f +M s . (9-9) 


Consider now an isolated beam bent by moment M, as indicated in Fig. 
9-7c. In this case, the deformation is assumed to consist entirely of the 
extension or contraction of longitudinal beam fibers by an amount proportional 
to their distance from the central or neutral axis, which remains unstrained in 
the process. The stress distribution reflects this by varying linearly across the 
section from maximum tension ( + a m ) to maximum compression (~o m ) at the 
outer beam fibers. In terms of the beam radius of curvature /?, and angle 6 
subtended, Hooke’s law yields 


(R ± df2)0 - R6\ Ed 

~ E * ~R0 | "" ± Jr 


(9-10) 


Corresponding to this stress distribution is the bending moment across the 
beam section: 


M = 




Ed 3 w 
12 R 


By extension of this result, we have 

M f = E f d}w/\2R and M s = E s d)w/\2R. 


(9-11) 


Lastly, in order to account for biaxial stress conditions, it is necessary to 
replace E f by E f /( 1 - v f ), and similarly for E s . Substitution of these terms 
in Eq. 9-9 yields 


2 ) Ff 12 R 



(9-12) 


Since d s is normally much larger than d fy the film stress a f is, to a good 
approximation, given by 


= J/ _ [ E s d) 

° f d f w 6R (1 - v s )d/ 


(9-13) 


Equation 9-13 is the Stoney formula. Values of o f are determined through 
measurement of R. The reader should be wary of oy values in the literature, 
since the (1 - v s ) correction is frequently omitted. 
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9.3.3. Thermal Stress 


Thermal effects provide important contributions to film stress (Ref. 9). Films 
and coatings prepared at elevated temperatures and then cooled to room 
temperature will be thermally stressed, as will films that are thermally cycled 
or cooled from the ambient to cryogenic temperatures. To see what the 
magnitude of the thermal stress is, consider a rod of length / 0 and modulus E , 
clamped at both ends. If its temperature is reduced from T 0 to T , it would 
tend to shrink in length by an amount equal to ct(T - T 0 )l 0 , where a is the 
coefficient of linear expansion (a = Al/1 0 AT(K~ 1 )). But the rod is con¬ 
strained and is, therefore, effectively elongated in tension. The tensile strain is 
simply e = a(T - T 0 )I 0 /l 0 - a(T - T 0 ), and the corresponding thermal 
stress, by Hooke’s law, is 

a = Ea(T — T 0 ) . (9-14) 

Consider now the film-substrate combination of Fig. 9-7 subjected to a 
temperature differential AT. The film and substrate strains are, respectively, 

e f = af AT + Fj-( 1 - v f )/E f d f w, (9-15a) 

= a s AT - F f { 1 - v s )/E s d s w. (9-15b) 


But strain compatibility requires that e f = e s \ therefore, the thermal mismatch 
force is 


w(a s - ccj) AT 

V= ((1 - *,)ld f E f ) + ((1 - v s )!d s E s ) ' 


(9-16) 


If d s E s /{ 1 - v s ) > d f E f j{\ - u f ), the thermal stress in the film is 

a f {T) = Fj/djW = (a s - a,) ATE f /(\ - (9-17) 

Note that the signs are consistent with dimensional changes in the film-sub¬ 
strate. Films prepared at elevated temperatures will be residually compressed 
when measured at room temperature (AT < 0) if ot s > a f . In this case the 
substrate shrinks more than the film. Overall, the system must contract a fixed 
amount, but a compromise is struck; the substrate is not allowed to contract 
fully and is, therefore, placed in tension, and the film hindered from shrinking, 
is consequently forced into compression. An example of this occurs in TiC 
coatings on steel. At the CVD deposition temperature of 1000 °C, it is 
assumed the coating and substrate are unstressed. For the values a slecl = 11 x 
10 “ 6 K \ a TiC = 8 X 10 ' 6 K" 1 , E TiC = 4.5 X 10 12 dynes/cm 2 (450 
kN/mm 2 ), and v f = 0.19, the compressive stress calculated for TiC, using Eq. 
9-17, is 1.67 x 10 !0 dynes/cm 2 (1.67 kN/mm 2 ) at 0 °C. 
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Lastly, by equating Eqs. 9-13 and 9-17, we obtain 


1 

7 ? 


6*/(l ~ »,) 

Es( 1 - y f ) 


dl 


(a s - ctf) AT. 


(9-18) 


This modification of Stoney’s equation represents the extent of bowing when 
differential thermal expansion effects cause the stress. In order to generally 
convert the measured deflection into film stress, we note that the curvature is 
related to the second derivative of the beam displacement; i.e., l/R = 
d 2 y{x)/dx 2 . After integration, y(x) = x 2 /2R. For a cantilever of length /, 
if the free-end displacement is 5, then 5 = l 2 /2R y and Stoney’s formula yields 

o f =bE s d 2 s /U 2 {\-v s )d f . (9-19) 


9,4. Stress in Thin Films 


9.4.1. Stress Measurement Techniques 

Two film-substrate configurations have been primarily used to determine 
internal stresses in films. In the first the substrate is fashioned into the shape of 
a cantilever beam. The film is deposited on one surface, and the deflection of 
the free end of the bent beam is then determined (Fig. 9-8a). A common 
measure of the sensitivity of the measurement is the smallest detectable force 


a 



b 


SUBSTRATE 



DEPOSITED FILM 
=m-0PTICAI FLAT 


i 


i 

I 



Figure 9-8. Schematic diagrams of film stress measurement techniques. (From Ref. 
10): (a) bending of cantilever beam; (b) bowing of circular plate. 
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per unit width 5, defined by the product of the film stress and thickness. By 
Eq. 9-19, 

S=o f d / =bE s d]/U 2 {\-v s ). (9-20) 

This definition circumvents difficulties in the magnitude of cr f in very thin 
films since d f is no longer in the denominator; S is simply proportional to the 
deflection. To obtain oy, d f must be independently known. For very thin films, 
measurement sensitivity is an issue of concern. The minimum value of S 
detected is frequently taken as a measure of the sensitivity. Typical values of S 
for the different experimental techniques are given in Tabic 9-1. 

Measurements of stress are frequently made in real time during film 
formation and growth. Sensitive electromechanical or magnetic restoration of 
the null, or undeflected position of the beam, in combination with measure¬ 
ment of the restoring force enables continuous monitoring. Null methods have 
a couple of advantages in film stress determinations relative to techniques in 
which beam (or plate) deflections are measured. One is the lack of stress 
relaxation in the film because the substrate is effectively restrained from 
deflecting. The second is that the frequently unknown value of Young's 
modulus is, surprisingly, not required to evaluate the stress in the film. Since 
there is no deflection, Stoney's formula is inappropriate. Rather, the restoring 
force establishes a moment, and from an equation of the type M ~ o m d 2 w/6 
(Eq. 9-11) the stress can be evaluated. 

Measurements for a number of metals evaporated onto glass cantilever 
substrates at room temperature are shown in Fig. 9-9. In addition to ay, which 
can be determined at any value of d f , the slope at any point of the S-d f plot 


Table 9-1. Sensitivity of Film Stress Measurement Techniques 


Method 

Substrate 

Configuration 

Detectable Force 
per Unit Width (dynes/cm) 

Optical 

B 

800 

Optical 

C 

250 

Capacitance 

C 

500 

Magnetic restoration 

C 

250 

Electromagnetic restoration 

C 

150 

Mechanical 

C 

1 

Electromechanical 

C 

1 

Interferometric 

C 

0.5 

Interferometric 

P 

15 

X-ray 


500 


B = beam supported on both ends; C = cantilever beam; P = circular plate. 
From Refs, 3 and 10. 



STRESS * THICKNESS (dynes/cm) x I0~ 4 




Figure 9-9. Internal stress values in a number of evaporated metal thin films: (a) S = o f d f vs. film thickness; (b) o f vs. film 
thickness. (From Ref. 11, reprinted by permission of the Electrochemical Society). 
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yields a quantity known as the incremental or instantaneous stress oy(z ). The 
latter is useful in characterizing dynamic changes in film stress. Thus, 

rdf dS 

S= a f (z) dz or a f (z) = —, (9-21) 

J 0 dz 

where oy(z) is the stress present in a layer thickness dz, at a distance z from 
the film-substrate interface. The reader should be alert to the fact that S, oy, 
and Oj{z) have all been used to report stress values in the literature. Unless 
otherwise noted, the single term oy will be used for the remainder of the 
chapter. 

The second type of substrate configuration frequently employed is the 
circular plate. In this case, Eq. 9-19 is assumed to describe the resultant stress, 
where / is now the plate radius and 6 represents the center deflection. Not only 
is the plate an important test geometry, it has obvious applications in optical 
components. As a practical example, consider a circular glass plate window 
with a diameter-to-thickness ratio of 16:1 having the elastic properties E = 
6.37 x 10 11 dynes/cm 2 and v - 0.25. If one face is coated with a 1000-A-thick 
single-layer antireflection film of MgF 2 , a stress of 3.33 x 10 9 dynes/cm 2 
develops after deposition. The depression of the center relative to the circum¬ 
ference is calculated to be 7.5 x 10“ 6 cm. For a wavelength X = 5000 A the 
extent of bowing corresponds to 0.15X. This change from a planar surface to a 
paraboloid of revolution is significant in the case of high-precision optical 
surfaces where the tolerance may be taken to be 0.05X. In the case of thin 
lenses, stress distortions can also change critical spacings in multi-element 
optical systems. A typical arrangement for stress measurement of plates is 
shown in Fig. 9-8b, where the change in the optical fringe pattern between the 
film substrate and an optical flat is used to measure the deformation. Interfer¬ 
ence patterns illustrating different biaxial stress states are shown in Fig. 9-10. 
Alternatively, a calibrated optical microscope can be used to measure the 
extent of bowing. 

There are a number of issues relevant to the experimental determination of 
accurate values of film stress. Some deal with the validity of elastic theory 
itself in treating the film-substrate composite properties, geometries, and 
deflections involved. In general, if deflections are small compared with the 
substrate beam or plate thickness, the simple theory suffices. Substrate elastic 
constant values should be chosen with care and checked to determine whether 
they are isotropic. Special care must be exercised when determining the stress 
in epitaxial films because elastic constants of both films and substrates are 
anisotropic, complicating the stress analysis. 

A number of useful methods for determining stress in films, including 
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Figure 9-10. Interference fringe patterns in biaxially stressed, sputtered Mo films. 
(From Ref. 12): (a) balanced biaxial tension (or compression) (o x = a y )\ (b) unbal¬ 
anced biaxial tension (or compression ) (o x ± o y )\ (c) one component tensile, one 
component compressive. 
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epitaxial films, are based on X-ray diffraction methods. As an example, 
consider a polycrystalline film containing an isotropic biaxial tensile stress 
distribution in the xy plane (a z = 0). The film contracts in the z direction by 
an amount (see Eq. 9-lb, also application to epitaxial films, p. 349) 

= (~v/E)(o x + o y ) = -v{e x + e y ). (9-22) 

By measuring the lattice spacing in the stressed film a 0 as well as unstressed 
bulk lattice (a 0 ) with X-rays, we can determine s z directly; i.e., e z = 
-(a {] - a 0 )/a {) . Since a x = cr y = cr f . 



The accuracy of the X-ray technique is considerably extended with high-preci¬ 
sion lattice parameter determinations. Precise determination of a 0 and a 0 is 
complicated by line broadening due to small grain size, dislocations, twins, 
stacking faults, and nonuniform microstrains. 


9.4.2. Measured Intrinsic Stress Behavior 

9.4.2.I. Evaporated Films . Despite the apparent simplicity of the experi¬ 
mental techniques and corresponding defining stress equations, the measured 
values of the intrinsic contribution to oy display bewildering variations as a 
function of deposition variables, nature of film-substrate combination, and 
film thickness. Some of the variety is evident in the measured stress values for 
1000-A-thick metal and nonmetal films (evaporated on room-temperature glass 
or silica substrates), which are tabulated in Table 9-2. These values for 
evaporated films should be considered representative rather than precise. Even 
though data published by different investigators employing similar and differ¬ 
ent measurement techniques are frequently inconsistent, the following trends 
can be discerned from published results in this field: 

a. In metals the film stress is invariably tensile with a magnitude ranging from 
10 8 to 10 10 dynes/cm 2 . 

b. There is no apparent strong dependence of stress on the nature of the 
substrate. 

c. In dielectric films, compressive and tensile stresses arise. 

d. The magnitude of the stress in nonmetallic films is frequently small. 

A simple way to rationalize the difference in behavior between metals and 
nonmetals is to note that metals are strong in tension, but offer little resistance 
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Table 9-2. Intrinsic Stress in Evaporated Films 


Metals* 

o f (10 9 dynes/cm 2 ) 

E (10 12 dynes/cm 2 ) 

Ag 

0.2 

0.76 

A1 

-0.74 

0.69 

Au 

2.6 

0.80 

Co 

8.4 

2.06 

Cu 

0.6 

1.17 

Cr 

8.5 

2.48 

Fe 

11.0 

2.0 

In 

- 0 

0.11 

Mn 

9.8 

1.58 

Mo 

10.8 

3.24 

Pd 

6 

1.12 

Ti 

- 0 

1.15 

Zr 

7 

0.94 

Nonmetals** 

Of (10 9 dynes/cm 2 ) 

E (10 12 dynes/cm 2 ) 

C (graphite) 

-4 

0.4 

Ge 

2.3 

1.58 

Si 

3 

2.0 

Te 

0.6 


ZnS 

-1.9 

0.54 

MgF 2 

3 to 7 

1.17 

PbF 2 

-0.2 


Cryolite 

0.2 


CaF 2 

0.2 


CeF 3 

2.2 


SiO 

0.1 


PbCl 2 

0.8 


CdTe 

-1.4 


Tl (I, Br) 

-0.07 


T1C1 3 

-0.3 


ThOF 2 

1.5 



Note: 1 dyne/cm 2 - 10“ 7 MN/m 2 — 10“ 1 Pa. 
*From Refs. 3 and 11. 

**From Refs. 3, 13, and 14. 


to compression, whereas insulators and semiconductors are strong in compres¬ 
sion and weak in tension. Therefore, metals prefer to be in tension, and 
nonmetals are best deposited in a state of compression. 


9.4.2.2. Thickness Dependence . The data of Fig. 9-9 provide a valuable 
means for evaluating distinctions and similarities among different metals 
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because of the common in situ measurement technique, the high vacuum ( 10 -6 
to 10" 7 torr) maintained during deposition and measurement, the similar film 
deposition rates, and the absence of a thermal stress contribution. With few 
exceptions, the film stress is always tensile. Hard refractory metals and metals 
with high melting points generally tend to exhibit higher residual stresses than 
softer, more easily melted metals. Appreciable film stress arises after only 100 
A or so of deposition, after which large stresses continue to develop up to a 
thickness of roughly 600 A. This thickness increment range is, not surpris¬ 
ingly, coincident with the typical coalescence and channel stages of growth 
leading to continuous film formation. There is thus good reason for the use of 
the term growth stress to denote intrinsic stress ( 07 ). With further film growth 
the stress does not change appreciably. 

9.4.2.3. Temperature Effects . When film deposition occurs at tempera¬ 
tures different from that at which the stress is measured, the previously 
considered differential thermal expansion contribution (Eq. 9-17) superimposes 
on the growth stress; i.e., 

* f =* f (T)+o n (9-24) 

Precise determinations of Oj require subtraction of the o f {T) correction. In 
addition, heated substrates alter the intrinsic stresses largely by promoting 
defect annealing and the processes of recrystallization or even grain growth if 
the temperature is high enough. The resultant softening relaxes the growth 
stresses, which fall rapidly with temperature. Diffusion of impurities into and 
out of the film is also accelerated, and this can give rise to substantial stress 
change. The total film stress may then show a minimum or even reverse in 
sign. Both kinds of behavior have been observed in practice. One implication 
of stress reversal is the possibility of depositing films with low or even 
near-zero stress levels. This is frequently a desirable feature, e.g., for mag¬ 
netic thin films. For the required critical properties to be achieved, film 
composition, deposition rates, and substrate temperature must be optimally 
adjusted. 

9.4.2.4. Sputtered Films. Unlike evaporated films, generalizations with 
respect to stress are difficult to make for sputtered metal films because of the 
complexity of the plasma environment and the effect of the working gas. 
However, at low substrate temperatures, compressive film stresses are often 
observed. The fact that the extent of compression varies directly with the 
amount of trapped gas has pointed to the latter as the source of stress. But this 
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is too simplistic a view. Sputtered films display a rich variety of effects, 
including tensile-to-compressive stress transitions as a function of process 
variables. For example, in rf-diode-sputtered tungsten films a stress reversal 
from tension to compression was achieved in no less than three ways (Ref. 15): 

a. By raising the power level about 30 W at zero substrate bias 

b. By reversing the dc bias from positive to negative 

c. By reducing the argon pressure 

Oxygen incorporation in the film favored tension, whereas argon was appar¬ 
ently responsible for the observed compression. 

The results of extensive studies by Hoffman and Thornton (Ref. 16) on 
magnetron-sputtered metal films are particularly instructive since the internal 
stress correlates directly with microstructural features and physical properties. 
Magnetron sputtering sources have made it possible to deposit films over a 
wide range of pressures and deposition rates in the absence of plasma bom¬ 
bardment and substrate heating. It was found that two distinct regimes, 


(m Torr) 




ATOMIC MASS 


Figure 9-11. (a) Biaxial internal stresses as a function of Ar pressure for Cr, Mo, 

Ta, and Pt films sputtered onto glass substrates: • parallel and ■ perpendicular to long 
axis of planar cathode. (From Ref. 16). (b) Ar transition pressure vs. atomic mass of 
sputtered metals for tensile to compressive stress reversal. (From Ref. 16). 
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separated by a relatively sharp boundary, exist where the change in film 
properties is almost discontinuous. The transition boundary can be thought of 
as a multidimensional space of the materials and processing variables involved. 
On one side of the boundary, the films contain compressive intrinsic stresses 
and entrapped gases, but exhibit near-bulklike values of electrical resistivity 
and optical reflectance. This side of the boundary occurs at low sputtering 
pressures, with light sputtering gases, high-mass targets, and low deposition 
rates. On the other hand, elevated sputtering pressures, more massive sputter¬ 
ing gases, light target metals, and oblique incidence of the depositing flux 
favor the generation of films possessing tensile stresses containing lesser 
amounts of entrapped gases. Internal stress as a function of the Ar pressure is 
shown in Fig. 9-1 la for planar magnetron-deposited Cr, Mo, Ta, and Pt. The 
pressure at which the stress reversal occurs is plotted in Fig. 9-1 lb versus the 
atomic mass of the metal. 

Comparison with the zone structure of sputtered films introduced in Chapter 
5 reveals that elevated working pressures are conducive to development of 
columnar grains with intercrystalline voids (zone 1). Such a structure exhibits 
high resistivity, low optical reflectivity, and tensile stresses. At lower pres¬ 
sures the development of the zone 1 structure is suppressed. Energetic particle 
bombardment, mainly by sputtered atoms, apparently induces compressive film 
stress by an atomic peening mechanism. 

9.4.3. Some Theories of Intrinsic Stress 

Over the years, many investigators have sought universal explanations for the 
origin of the constrained shrinkage that is responsible for the intrinsic stress. 
Buckel (Ref. 17) classified the conditions and processes conducive to internal 
stress generation into the following categories, some of which have already 
been discussed: 

1. Differences in the expansion coefficients of film and substrate 

2. Incorporation of atoms (e.g., residual gases) or chemical reactions 

3. Differences in the lattice spacing of monocrystalline substrates and the film 
during epitaxial growth 

4. Variation of the interatomic spacing with the crystal size 

5. Recrystallization processes 

6. Microscopic voids and special arrangements of dislocations 

7. Phase transformations 

One of the mechanisms that explains the large intrinsic tensile stresses 
observed in metal films is related to item 5. The model by Klokholm and Berry 
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(Ref. 11) suggests that the stress arises from the annealing and shrinkage of 
disordered material buried behind the advancing surface of the growing film. 
The magnitude of the stress reflects the amount of disorder present on the 
surface layer before it is covered by successive condensing layers. If the film is 
assumed to grow at a steady-state rate of G monolayers/sec, the atoms will on 
average remain on the surface for a time G ~'. In this time interval, thermally 
activated atom movements occur to improve the crystalline order (recrystalliza¬ 
tion) of the film surface. These processes occur at a rate r described by an 
Arrhenius behavior, 

r = v 0 e- E ’/ RT s, (9-25) 

where y 0 is a vibrational frequency factor, E r is an appropriate activation 
energy, and T s is the substrate temperature. On this basis it is apparent that 
high-growth stresses correspond to the condition G>r, low-growth stresses 
to the reverse case. At the transition between these two stress regimes, G = r 
and E r /RT S = 32, if G is 1 sec" 1 and v 0 is taken to be 10' 4 sec 1 . 
Experimental data in metal films generally show a steep decline in stress when 
T m /T s = 4.5, where T M is the melting point. Therefore, E r = 32 RT M /4.5 
= 14.2 T m . In Chapter 8 it was shown that for FCC metals the self-transport 
activation energies are proportional to T M as 34 T M , 25T M , 17.8 T M , and 
137^ for lattice, dislocation, grain-boundary, and surface diffusion mecha¬ 
nisms, respectively. The apparent conclusion is that either surface or grain- 
boundary diffusion of vacancies governs the temperature dependence of film 
growth stresses by removing the structural disorder at the surface of film 
crystallites. 

Hoffman (Ref. 18) has addressed stress development due to coalescence of 
isolated crystallites when forming a grain boundary. Through deposition 
neighboring crystallites enlarge until a small gap exists between them. The 
interatomic forces acting across this gap cause a constrained relaxation of the 
top layer of each surface as the grain boundary forms. The relaxation is 
constrained because the crystallites adhere to the substrate, and the result of the 
deformation is manifested macroscopically as observed stress. 

We can assume an energy of interaction between crystallites shown in Fig. 
9-12 in much the same fashion as between atoms (Fig. l-8b). At the equilib¬ 
rium distance a , two surfaces of energy y s are eliminated and replaced by a 
grain boundary of energy y b . For large-angle grain boundaries y gh «= 
(l/3)y 5 , so that the energy difference 2y s - y gh ~ (5/3)y s represents the 
depth of the potential at a . As the film grows, atoms are imagined to 
individually occupy positions ranging from r (a hard-core radius) to 2 a (the 
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r a 2a 


ATOMIC SEPARATION 

Figure 9-12. Grain-boundary potential. (Reprinted with permission from Elsevier 
Sequoia, S.A., from R. W. Hoffman, Thin Solid Films 34, 185, 1976). 


nearest-neighbor separation) with equal probability. Between these positions 
the system energy is lowered. If an atom occupies a place between r and a, it 
would expand the film in an effort to settle in the most favored position— a. 

Similarly, atoms deposited between a and 2 a cause a film contraction. 
Because the potential is asymmetric, contraction relative to the substrate 
dominates leading to tensile film stresses. An estimate of the magnitude of the 
stress is 


EAP 

1 - v * d c ’ 


(9-26) 


where d c is the mean crystallite diameter and P is the packing density of the 
film. The quantity A is the constrained relaxation length and can be calculated 
from the interaction potential between atoms. When divided by d c , A /d c 
represents an “effective” strain. In Cr films, for example, where E/(\ - v) 
= 3.89 x 10 12 , d c = 130 A y A = 0.89 A, and P = 0.96, the film stress is 
calculated to be 2.56 X 10 10 dynes/cm 2 . Employing this approach, Pulker and 
Maser (Ref. 19) have calculated values of the tensile stress in MgF 2 and 
compressive stress in ZnS in good agreement with measured values. 

A truly quantitative theory for film stress has yet to be developed, and it is 
doubtful that one will emerge that is valid for different film materials and 
methods of deposition. Uncertain atomic compositions, structural arrangements 
and interactions in crystallites and at the film-substrate interface are not easily 
amenable to a description in terms of macroscopic stress-strain concepts. 
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9 . 5 . Relaxation Effects in Stressed Films 


Until now, we have only considered stresses arising during film formation 
processes. During subsequent use, the grown-in elastic-plastic state of stress 
in the film may remain relatively unchanged with time. However, when films 
are exposed to elevated temperatures or undergo relatively large temperature 
excursions, they frequently display a number of interesting time-dependent 
deformation processes characterized by the thermally activated motion of 
atoms and defects. As a result, local changes in the film topography can occur 
and stress levels may be reduced. In this section we explore some of these 
phenomena that are exemplified in materials ranging from lead alloy films 
employed in superconducting Josephson junction devices to thermally grown 
Si0 2 films in integrated circuits. 


9.5.1. Stress Relaxation in Thermally Grown Si0 2 

As noted previously (page 395), a volume change of some 220% occurs when 
Si is converted into Si0 2 . This expansion is constrained by the adhesion in the 
plane of the Si wafer surface. Large intrinsic compressive stresses are, 
therefore, expected to develop in Si0 2 films in the absence of any stress 
relaxation. A value of 3 X 10 11 dynes/cm 2 has, in fact, been estimated (Ref. 
20), but such a stress level would cause mechanical fracture of both the Si and 
Si0 2 . Not only does oxidation of Si occur without catastrophic failure, but 
virtually no intrinsic stress is measured in Si0 2 grown above 1000 °C. To 
explain the paradoxical lack of stress, let us consider the viscous flow model 
depicted in Fig. 9-13. For simplicity, only uniaxial compressive stresses are 
assumed to act on a slab of Si0 2 , which is free to flow vertically. The Si0 2 
film is modeled as a viscoelastic solid whose overall mechanical response 
reflects that of a series combination of an elastic spring and a viscous dashpot 
(Fig. 13b). Under loading, the spring instantaneously deforms elastically, 
whereas the dashpot strains in a time-dependent viscous fashion. If and e 2 
represent the strains in the spring and dashpot, respectively, then the total 
strain is 

£ t = e i + e 2 • (9-27) 

The same compressive stress a x acts on both the spring and dashpot so that 
= o x /E and e 2 = a x/V’ where e 2 = de 2 /dt , and 17 is the coefficient of 
viscosity. Here we recognize that the rate of deformation of glassy materials, 
including Si0 2 , is directly proportional to stress. Assuming e T is constant. 
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Figure 9-13. (a) Viscous flow model of stress relaxation in Si0 2 films. (From Ref. 

20); (b) spring-dashpot model for stress relaxation; (c) spring-dashpot model for strain 
relaxation. 

e ! = - e 2 or (1 /E)da x /dt = - o x /rf . Upon integration, we obtain 

0 x = a 0 e- E,/ \ (9-28) 

The initial stress in the film, a 0 , therefore relaxes by decaying exponentially 
with time. With E - 6.6 X 10 11 dynes/cm 2 and t) = 2.8 x 10 12 dynes- 
sec/cm 2 at 1100 °C, the time it takes for the initial stress to decay to a 0 je is a 
mere 4.3 sec. Oxides grown at this temperature are, therefore, expected to be 
unstressed. Since tj is thermally activated, oxides grown at lower temperatures 
will generally possess intrinsic stress. The lack of viscous flow in a time 
comparable to that of oxide growth limits stress relief in such a case. 
Typically, intrinsic compressive stresses of 7 x 10 9 dynes/cm 2 have been 
measured in such cases. 

9.5.2. Strain Relaxation in Films 

It is worthwhile to note the distinction between stress and strain relaxation. 
Stress relaxation in the Si0 2 films just described occurred at a constant total 
strain or extension in much the same way that tightened bolts lose their tension 
with time. Strain relaxation, on the other hand, is generally caused by a 
constant load or stress and results in an irreversible time-dependent stretching 
(or contraction) of the material. The latter can be modeled by a spring and 
dashpot connected in parallel combination (Fig. 13c). Under the application of 
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a tensile stress the spring wishes to instantaneously extend, but is restrained 
from doing so by the viscous response of the dashpot. It is left as an exercise 
for the reader to show that the strain relaxation in this case has a time 
dependence given by 



In actual materials complex admixtures of stress and strain relaxation effects 
may occur simultaneously. 

Film strains can be relaxed by several possible deformation or strain 
relaxation mechanisms. The rate of relaxation for each mechanism is generally 
strongly dependent on the film stress and temperature, and the operative or 
dominant mechanism is the one that relaxes strain the fastest. A useful way to 
represent the operative regime for a given deformation mechanism is through 
the use of a map first developed for bulk materials (Ref. 21), and then 
extended to thin films by Murakami et aL (Ref. 22). Such a map for a 
Pb-In-Au film is shown in Fig. 9-14 where the following four strain relax¬ 
ation mechanisms are taken into account: 

L Defectless Flow . When the stresses are very high, slip planes can be 
rigidly displaced over neighboring planes. The theoretical shear stress of 
magnitude — fx/20 is required for such flow. Stresses in excess of this value 
essentially cause very large strain rates. Below the theoretical shear stress limit 
the plastic strain rate is zero. Defectless flow is dominant when the normalized 
tensile stress ( o/fi ) is greater than — 9 X 10 -2 , or above the horizontal 
dotted line. This regime of flow will not normally be accessed in films. 

2. Dislocation Glide . Under stresses sufficiently high to cause plastic 
deformation, dislocation glide is the dominant mechanism in ductile materials. 
Dislocation motion is impeded by the presence of obstacles such as impurity 
atoms, precipitates, and other dislocations. In thin films, additional obstacles to 
dislocation motion such as the native oxide, the substrate, and grain boundaries 
are present. Thus, the film thickness d and grain size, / g , may be thought of 
as obstacle spacings in Eq. 9-3. An empirical law for the dislocation glide 
strain rate e 2 as a function of stress and temperature is 

&2 = - A G/kT, (9-30) 

where a 0 is the flow stress at absolute zero temperature, AG is the free energy 
required to overcome obstacles, e 0 is a pre-exponential factor, and kT has the 
usual meaning. 
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3 . Dislocation Climb . When the temperature is raised sufficiently, dislo¬ 
cations can acquire a new degree of motional freedom. Rather than be impeded 
by obstacles in the slip plane, dislocations can circumvent them by climbing 
vertically and then gliding. This sequence can be repeated at new obstacles. 
The resulting strain rate of this so-called climb controlled creep depends on 
temperature and is given by 


at T> 0.3T m ; 

pb , 

£3 " Ai kT° b 1 

(’)■ 

(9-31) 

at T > 0.6 T m \ 

lib | 

e, = A ,— D L \ 

(-)■ 

(9-32) 


Here, D b and D L are the thermally activated grain-boundary and lattice 
diffusion coefficients, respectively, and A 3 and A 4 are constants. 


4. Diffusional Creep . Viscous creep in polycrystalline films can occur by 
diffusion of atoms within grains (Nabarro-Herring creep) or by atomic 
transport through grain boundaries (Coble creep). The respective strain rates 
are given by 


fi 12 { a \ 

£s = 

\i 12 hD b l a 

£(> = A(, ~kf T7d 2 


(9-33) 

(9-34) 


where in addition to constants A s and A 6 , 12 is the atomic volume and 5 is 
the grain-boundary width. It is instructive to think of the last two equations as 
variations on the theme of the Nemst-Einstein equation (Eq. 1-35). The 
difference is that in the present context the applied stress (force) is coupled to 
the resultant rate of straining (velocity). Rather than the linear coupling of i 
and a in diffusional creep, a stronger nonlinear dependence on stress is 
observed for dislocation climb processes. 


In constructing the deformation mechanism map, the process exhibiting the 
largest strain relaxation rate is calculated at each point in the field of the 
normalized stress-temperature space. The field boundaries are determined by 
equating pairs of rate equations for the dominant mechanisms and solving for 
the resulting stress dependence on temperature. 


9.5.3. Relaxation Effects in Metal Films during Thermal Cycling 

An interesting application of strain relaxation effects is found in Josephson 
superconducting tunnel-junction devices (Ref. 23) (These are further discussed 
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Figure 9-14. Deformation mechanism map for Pb-In-Au thin films. (From Ref. 
23). Inset: Schematic cross section of Pb alloy Josephson junction device. (From 
Ref 22). 


in Chapter 14.) A schematic cross section of such a device is shown in the inset 
of Fig. 9-14. The mechanism of operation need not concern us, but their very 
fast switching speeds (e.g., ~ 10“ 11 sec) combined with low-power dissipa¬ 
tion levels (e.g., — 10” 6 W/device) offer the exciting potential of building 
ultrahigh speed computers based on these devices. The junction basically 

o 

consists of two superconducting electrodes separated by an ultrathin 60-A-thick 
tunnel barrier. Lead alloy films serve as the electrode materials primarily 
because they have a relatively high superconducting transition temperature* 
and are easy to deposit and pattern. The thickness of the tunnel barrier oxide is 
critical and can be controlled to within one atomic layer through oxidation of 


*The application described here predates the explosion of activity in YBa 2 Cu,0 7 ceramic 
superconductors (see Chapter 14). 
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Pb alloy films. Fast switching and resetting times are ensured by the low 
dielectric constant of the Pb0-In 2 0 3 barrier film. A serious materials-related 
concern with this junction structure is the reliability of the device during 
thermal cycling between room temperature and liquid helium temperature (4.2 
K) where the device is operated. The failure of some devices is caused by the 
rupture of the ultrathin tunnel barrier due to the mismatch in thermal expansion 
between Pb alloys and the Si substrate on which the device is built. During 
temperature cycling the thermal strains are relaxed by the plastic deformation 
processes just considered resulting in harmful dimensional changes. 

Let us now trace the mechanical history of an initially unstressed Pb film as 
it is cooled to 4.2 K. Assuming no strain relaxation, path a in Fig. 9-14 
indicates that the grain-boundary creep field is traversed from 300 to 200 K, 
followed by dislocation glide at lower temperatures. Because cooling rates are 
high at 300 K, there is insufficient thermal energy to cause diffusional creep. 
Therefore, dislocation glide within film grains is expected to be the dominant 
deformation mechanism on cooling. If, however, no strain relaxation occurs, 
the film could then be rewarmed and the o-T path would be reversibly 
traversed if, again, no diffusional creep occurs. Under these conditions the 
film could be thermally cycled without apparent alteration of the state of stress 
and strain. If, however, a relaxation of the thermal strain by dislocation glide 
did occur upon cooling, then the path followed during rewarming would be 
along b. Because the coefficient of thermal expansion for Pb exceeds that of 
Si, a large tensile stress initially develops in the film at 4.2 K. As the 
temperature is raised, dislocation glide rapidly relaxes the stress so that at 200 
K the tensile stress effectively vanishes. Further warming from 200 to 300 K 
induces compressive film stresses. These provide the driving force to produce 
micron-sized protrusions or so-called hillock or stunted whisker growths from 
the film surface. This manifestation of strain relaxation is encouraged because 
grain-boundary diffusional creep is operative in Pb over the subroom tempera¬ 
ture range. 

It is clear that in order to prevent the troublesome hillocks from forming, it 
is necessary to strengthen the electrode film. This will minimize the dislocation 
glide that originally set in motion the train of events leading to hillock 
formation. Practical methods for strengthening bulk metals include alloying 
and reducing the grain size in order to create impediments to dislocation 
motion. Indeed, the alloying of Pb with In and Au caused fine intermetallic 
compounds to form, which hardened the films and refined the grain size. The 
result was a suppression of strain relaxation effects and the elimination of 
hillock formation. Overall, a dramatic reduction in device failure due to 
thermal cycling was realized. Nevertheless, for these and other reasons, Nb, a 
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much harder material than Pb, has replaced the latter in Josephson junction 
computer devices. 


9.5.4. Hillock Formation 


In multilayer integrated devices hillocks are detrimental because their penetra¬ 
tion of insulating films can lead to electrical short circuits. Hillocks and 
whiskers have been observed to sprout during electromigration (see Section 8.4 
and Fig. 8-15a). Where glass films overlay interconnections, they serve to 
conformally constrain the powered metal conductors. The situation is much 
like a glass film vessel pressurized by an electromigration mass flux. Compres¬ 
sive stresses in the conductor induced by electrotransport can be relieved by 
extrusion of hillocks or whiskers, which sometimes leads to cracking of the 
insulating dielectric overlayer. Interestingly, processes that reduce the com¬ 
pression or create tensile stresses, such as current reversal during electromigra¬ 
tion or thermal cycling, sometimes cause the hillocks to shrink in size. 

From the foregoing examples it is clear that the rate of relieval of compres¬ 
sive stress governs hillock growth. Dislocation flow mechanisms cannot gener¬ 
ally relax stress because the intrinsic stress level present in soft polycrystalline 
metal films is insufficient to activate dislocation sources within grains, at grain 
boundaries, or at the film surface. However, diffusional creep processes can 
relieve the stress. We close this section with the suggestion that diffusional 
creep relaxation of the compressive stress in a film is analogous to the 
outdiffusion of a supersaturated specie from a solid, e.g., outgassing of a strip. 
The rate of stress change is then governed by 


do(x,t) d 2 o(x,t) 

— -- = D --r=—- 

dt dx 2 


(9-35) 


where compressive stress simply substitutes for excess concentration in the 
diffusion equation. If, for example, a film of thickness d contains an initial 
internal compressive stress (r( 0 ) and stress-free surfaces at x = 0 and x - d , 
i.e., ct( 0 , 0 = a(d , /) = 0 , then the stress relaxes according to the equation 


a(x, t) 


4<t(0) “ ( 1 (2n + 1)7r jc ( 2/7 + l) 2 ir 2 Dt ) 

—— n — exp - ?—)• 

(9-36) 


Boundary value problems of this kind have been treated in the literature to 
account for hillock growth kinetics, and the reader is referred to original 
sources for details (Ref. 24). 
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9 . 6 . Adhesion 


9.6.1. Introduction 

The term adhesion refers to the interaction between the closely contiguous 
surfaces of adjacent bodies, i.e., a film and substrate. According to the 
American Society for Testing and Materials (ASTM), adhesion is defined as 
the condition in which two surfaces are held together by valence forces or by 
mechanical anchoring or by both together. Adhesion to the substrate is 
certainly the first attribute a film must possess before any of its other 
properties can be further successfully exploited. Even though it is of critical 
importance adhesion is one of the least understood properties. The lack of a 
broadly applicable method for quantitatively measuring “adhesion” makes it 
virtually impossible to test any of the proposed theories for it. This state of 
affairs has persisted for years and has essentially spawned two attitudes with 
respect to the subject (Ref. 25). The “academic” approach is concerned with 
the nature of bonding and the microscopic details of the electronic and 
chemical interactions at the film-substrate interface. Clearly, a detailed under¬ 
standing of this interface is essential to better predict the behavior of the 
macrosystem, but atomistic models of the former have thus far been unsuccess¬ 
fully extrapolated to describe the continuum behavior of the latter. For this 
reason the “pragmatic” approach to adhesion by the thin-film technologist has 
naturally evolved. The primary focus here is to view the effect of adhesion on 
film quality, durability, and environmental stability. Whereas the atomic 
binding energy may be taken as a significant measure of adhesion for the 
academic, the pragmatist favors the use of large-area mechanical tests to 
measure the force or energy required to separate the film from the substrate. 
Both approaches are, of course, valuable in dealing with this difficult subject, 
and we shall adopt aspects of these contrasting viewpoints in the ensuing 
discussion of adhesion mechanisms, measurement methods, and ways of 
influencing adhesion. 

9.6.2. Energetics of Adhesion 

From a thermodynamic standpoint the work W A required to separate a unit 
area of two phases forming an interface is expressed by 

^ = 7/+7.-7/.- (9-37) 

The quantities y f and y s are the specific surface energies of film and substrate, 
and y fs is the interfacial energy. A positive W A denotes attraction (adhesion), 
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and a negative W A implies repulsion (de-adhesion). The work W A is largest 
when materials of high surface energy come into contact such as metals with 
high melting points. Conversely, W A is smallest when low-surface-energy 
materials such as polymers are brought into contact. When f and s are 
identical, then an interfacial grain boundary forms where 7/+7s>7/ 5 - 
Under these circumstances, y f — y s and y fs is relatively small; e.g., y fs ~ 
(l/3)7 5 in metals. If, however, a homoepitaxial film is involved, then y fs = 0 
by definition, and W A = 2y s . Attempts to separate an epitaxial film from its 
substrate will likely cause a cohesion failure through the bulk rather than an 
adhesion failure at the interface. When the film-substrate combination is 
composed of different materials, y fs may be appreciable, thus reducing the 
magnitude of W A . Interfacial adhesion failures tend to be more common under 
such circumstances. In general, the magnitude of W A increases in the order (a) 
immiscible materials with different types of bonding, e.g., metal-polymer, (b) 
solid-solution formers, and (c) same materials. Measured values of adhesion 
will differ from intrinsic W A values because of contributions from chemical 
interactions, interdiffusional effects, internal film stresses, interfacial impuri¬ 
ties, imperfect contact, etc. 


9.6.3. Film - Substrate Interfaces 

The type of interfacial region formed during deposition will depend not only on 
W A but also on the substrate morphology, chemical interactions, diffusion 
rates and nucleation processes. At least four types of interfaces can be 
distinguished, and these are depicted in Fig. 9-15. 

1. The abrupt interface is characterized by a sudden change from the film 
to the substrate material within a distance of the order of the atomic spacing 

o 

(1-5 A). Concurrently, abrupt changes in materials properties occur due to the 
lack of interaction between film and substrate atoms, and low interdiffusion 
rates. In this type of interface, stresses and defects are confined to a narrow 
planar region where stress gradients are high. Film adhesion in this case will 
be low because of easy interfacial fracture modes. Roughening of the substrate 
surface will tend to promote better adhesion. 

2. The compound interface is characterized by a layer or multilayer struc¬ 
ture many atomic dimensions thick that is created by chemical reaction and 
diffusion between film and substrate atoms. The compounds formed are 
frequently brittle because of high stresses generated by volumetric changes 
accompanying reaction. Such interfaces arise in oxygen-active metal films on 
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Figure 9-15. Different interfacial layers formed between film and substrate: (1) 
abrupt interface; (2) compound interface; (3) diffusion interface; (4) mechanical anchor¬ 
ing at interface. 


oxide substrates or between intermetallic compounds and metals. Adhesion is 
generally good if the interfacial layer is thin, but is poor if thicker layers form. 

3. The diffusion interface is characterized by a gradual change in composi¬ 
tion between film and substrate. The mutual solubility of film and substrate 
precludes the formation of interfacial compounds. Differing atomic mobilities 
may cause void formation due to the Kirkendall effect (Chapter 8). This effect 
tends to weaken the interface. Usually, however, interdiffusion results in good 
adhesion. A related type of transition zone which can strongly promote 
adhesion is the interfacial “pseudodiffusion” layer. Such layers are formed 
when film deposition occurs under the simultaneous ion bombardment present 
during sputtering or ion plating. In this way backscattered atoms sputtered 
from the substrate efficiently mix with the incoming vapor atoms of the film to 
be deposited. The resulting condensate may be thought of as a metastable phase 
in which the solubility of the components involved exceed equilibrium limits. 
The generally high concentration of point defects and structural disorder 
introduced by these processes greatly enhance “diffusion” between materials 
that do not naturally mix or adhere. 

Important examples of interdiffusion adhesion are to be found in polymer 
systems that are widely used as adhesives. In view of the above, it is not 
surprising that interdiffusion of polymer chains across an interface requires 
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that the adhesive and substrate be mutually soluble and that the macro¬ 
molecules or segments be sufficiently mobile. Such conditions are easily met in 
the autoadhesion of elastomers and in the solvent bonding of compatible 
amorphous plastics. 

4. The mechanical interface is characterized by interlocking of the deposit¬ 
ing material with a rough substrate surface. The adhesion strength depends 
primarily on the mechanical properties of film and substrate and on the 
interfacial geometry. A tortuous fracture path induced by rough surfaces and 
mechanical anchoring leads to high adhesion. Mechanical interlocking is relied 
upon during both electroplating and vacuum metalization of polymers. 

9.6.4. Theories of Adhesion (Ref. 25) 

The adsorption theory is most generally accepted and suggests that when 
sufficiently intimate contact is achieved at the interface between film and 
substrate, the surfaces will adhere because of the pairwise interaction of the 
involved atoms or molecules. There is no reason to believe that the forces that 
act in adhesion are any different from those that are functional within bulk 
matter. Therefore, the interaction energy typically follows the behavior de¬ 
picted in Fig. l-8b as a function of separation distance regardless of the type of 
materials or surface forces involved. It is believed that the largest contribution 
to the overall adhesion energy is provided by van der Waals forces (physio- 
sorption). These are classified into London, Debye, and Keesom types depend¬ 
ing, respectively, on whether neither, one, or both of the paired atoms possess 
electric dipoles. Interaction energies between film and substrate atoms typically 
fall off as the sixth power of the separation distance. The resulting forces are 
weak and secondary bonding is said to exist with energies of 0.1 eV per atomic 
pair. In addition to van der Waals forces, chemical interactions (chemisorp¬ 
tion) also contribute to adhesion. Stronger primary covalent, ionic, and metal¬ 
lic binding forces are involved now, and bond energies of 1 to 10 eV can be 
expected. 

For a typical interface containing some 10 15 primary bonds/cm 2 at 1 eV per 
bond, the total energy is 10 15 eV/cm 2 or 1600 ergs/cm 2 . This corresponds to 
typical surface energies of metals. The bonding force can be obtained from the 
bond energy if its variation with separation distance is known. If, for example, 

o 

the adhesion energy drops to zero when the surfaces are parted by some 5 A, 
then the specific adhesion force is F A = (1600 ergs/cm 2 )/5 x 10~ 8 cm or 
3.2 X 10 10 dynes/cm 2 . In contrast, van der Waals adhesion forces are ex¬ 
pected to be an order of magnitude less or roughly 10 9 dynes/cm 2 . Secondary 
bonding forces alone may result in adequate adhesion, but the presence of 
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primary bonds can considerably increase the joint strength. Surface-specific 
analytical techniques such as laser-Raman scattering. X-ray photoelectron 
spectroscopy, and SIMS have yielded definitive evidence that primary interfa¬ 
cial bonding contributes significantly to the intrinsic adhesion. 

Exchange of charge across film-substrate interfaces also contributes to 
adhesion. As a result, electrical double layers consisting of oppositely charged 
sheets develop and exert adhesive forces. The latter, however, are generally 
small compared with physiosorption forces. The situation is like that of a 
parallel-plate capacitor. Chapman (Ref. 25) has estimated that the attractive 
force is Q 2 / 2e 0 per unit area, where Q is the charge density /cm 2 and e 0 is 
the permittivity of free space. If Q = 10 n -10 13 electronic charges/cm 2 then 
the resulting attractive forces are 10 4 -10 8 dynes/cm 2 . These are small com¬ 
pared with other force contributions to adhesion. 

Theories do not always provide guidelines on how to practically achieve 
good film adhesion in practice. Conventional wisdom, for example, suggests 
using very clean substrates. This is not necessarily true for the deposition of 
metals on glass substrates because optimum adhesion appears to occur only 
when the metal contacts the substrate through an oxide bond. Thus A1 adheres 
better when there is some A1 2 0 3 present between it and the glass substrate. It 
is not surprising that strong oxide formers adhere well to glass. Intermediate 
oxide layers can be produced by depositing metals with large heats of oxide 
formation such as Cr, Ti, Mo, and Ta. Reactions of the type given by Eq. 1-16 
proceed at the interface resulting in good adhesion. Conversely, the noble 
metals such as Au and Ag do not form oxides readily and, accordingly, adhere 
poorly to glass, a fact reflected in low film stresses (Table 9-2). To promote 
adhesion, it is common practice, therefore, to first deposit a few hundred 
angstroms of an intermediate oxygen-active metal to serve as the “glue” 
between the film and substrate. This is the basis of several multilayer metal¬ 
lization contact systems such as Ti-Au, Ti-Pd-Au and Ti-Pt-Au. After 
deposition of the intermediate glue layer, the second film should be deposited 
without delay, for otherwise the glue metal may oxidize and impede adhesion 
of the covering metal film. 

9.6.5. Adhesion Tests 

Although there are no ways to directly measure interfacial atomic bond 
strengths, numerous tests characterize adhesion practically. These tests have 
been recently reviewed by Steinmann and Hintermann (Ref. 26), and Valli 
(Ref. 27). Essentially two types of tests are distinguished by whether tensile or 
shear stresses are generated at the interface during testing. 






Figure 9-16. Adhesion test methods: (a) pull-off test; (b) adhesive tape test; (c) scratch test. (a. and b. from Ref. 26, c. from Ref. 14). 
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9.6.5.1. Tensile-Type Tests . The simplest of these include direct pull-off 
as well as so-called topple tests and both are used primarily for coatings. As 
Fig. 9-16a indicates, force is applied to a member glued or soldered to the 
coating, and the resultant load to cause interfacial separation is then measured. 
Misalignment problems associated with normal pulling are partially overcome 
by applying a torque in the topple tests. The value of F A is equal to F y the 
applied force at separation divided by the contact area A . 

Acceleration tests also generate tensile stresses in the coating but without the 
disadvantage of glues and mechanical linkages. In the ultracentrifugal method a 
coated cylinder is levitated electromagnetically and spun at ever-increasing 
speed until the coating debonds from the substrate. 

Pulsed lasers have also been used to measure adhesion forces. When the 
back of the substrate is exposed to the laser pulse, successive compressive and 
tensile shock waves rapidly flex the substrate backward and then forward, 
detaching the coating in the process. Adhesion is characterized by the energy 
absorbed per unit area. 

9. 6.5. 2. Shear-Type Tests - The adhesive tape test developed over a half 
century ago provides the simplest and quickest qualitative measure of film or 
coating adhesion. Schematically indicated in Fig. 9-16b, the test can distin¬ 
guish between complete lifting, partial lifting, or complete adhesion with a 
little bit of discrimination. The test can also be made semiquantitative by 
controlling the angle of pull and the rate of pulling. With improved adhesives 
the force required to peel the tape is measured as a function of angle; the force 
extrapolated to zero angle is a measure of the adhesion. In such tests it is 
necessary that the tape-film bond be stronger than the film-substrate bond. 

9.6.5.3. Scratch Tests . The scratch test shown schematically in Fig. 9-16c 
is a widely used means of evaluating the adhesion of films. The test consists of 
drawing a stylus or indenter of known radius of curvature over a film or 
coating under increasing vertical loads. Resultant scratches are observed under 
an optical or scanning electron microscope in order to estimate the minimum 
or critical load required to scribe away the film and leave a clear channel or 
visible substrate behind. The elastoplastic deformation is complicated, how¬ 
ever, and films can be thinned and appear translucent while still adhering to 
the substrate. Alternatively, films can remain opaque when detached. Com¬ 
mercial equipment is available to enable the critical load to be determined on 
the basis of a single scratch. This is accomplished by ramping the indenting 
load between set limits, followed by visual examination of the scratch to 
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determine the critical load F c that just causes adhesion failure. The scratching 
process is also accompanied by the emission of acoustic signals that are small 
in magnitude when the film adheres at low loads. The onset of large acoustic 
emission caused by shearing or fracture at the film-substrate interface has 
been taken as a measure of the critical de-adhesion load, thus obviating the 
need for microscopic examination. Theoretical analyses relating the critical 
load, stylus geometry, and scratch dimensions to the specific adhesion force 
have been made. One such relation is 

F a = KH v F c /ttR\ (9-38) 

where the magnitude of coefficient K depends on the model details (K can 
range from 0.2 to 1), H v is the Vickers hardness (see page 562), and R is the 
radius of the stylus tip. 

At present there is little quantitative agreement in F A values obtained from 
different adhesion test methods. Rather, individual tests are well suited to 
internal comparisons of the same film-substrate combination prepared in 
different ways. 



Exercises 



1. Identical metal films of equal thickness, deposited on both sides of a thin 
substrate strip are found to possess a residual tensile stress. One of the 
films is completely removed by sputter-etching. Qualitatively describe 
how the remaining film-substrate combination deforms or bows. 

2 . Stress fields exists around dislocations resulting in matrix distortions 
shown in Fig. 1-6. 

a. A row of edge misfit dislocations of the same sign (orientation) lies 
within a thin film close to and parallel to the substrate interface. 
Comment on the internal stress in the film. 

b. How would the film stress differ if the dislocations were screw type? 

c. Due to annealing, some dislocations climb vertically and some disap¬ 
pear. How does this affect internal stress? 

3 - It is desired to grow epitaxial films of GaSb on AlSb substrates by 
deposition at 500 °C. Refer to Table 7-1. 

a. What is the expected lattice mismatch at 500 °C? 

b. What thermal stress can be expected in the film at 20°C if E GiiSb = 91.6 
GPa and ^ 3 $^ = 0.3? 
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4- Suppose S = Kd n describes the behavior of the stress (o f ) x thickness 
(d) of a film as a function of d. (K and n are constants.) Contrast the 
variation of film stress and instantaneous stress versus d. 

5 . a. Consider the strain relaxation of a parallel spring-dashpot combina¬ 

tion under constant loading and derive Eq. 9-29. 
b. The intrinsic stress in a Si0 2 film is 10 10 dynes/cm 2 . If the coefficient 
of viscosity of Si0 2 film is t/(T) = 1.5 X 10 _8 exp E V /RT (E v - 137 
kcal/mole) over the temperature range 900-1500 °C, how long will it 
take the film to reach half of its final strain at 1000 °C. Assume 
E = 6.6 x 10" dynes/cm 2 , and the units of are Poise. 

6. An engineer wishes to determine whether there will be more bow at 20 °C 
in a Si wafer with a 1-^m-thick Si0 2 film, or with a l-ju,m-thick Si 3 N 4 
film. Both films are deposited at 500 °C on a 0.5 mm/'Si wafer. At the 
deposition temperature the intrinsic stresses are -3 X 10 9 dynes/cm 2 for 
SiO z and -6 x 10 9 dynes/cm 2 for Si 3 N 4 . If the respective moduli are 
£ si0 . = 7.3 X 10 11 dynes/cm 2 , £ SiiNj = 15 X 10 11 dynes/cm 2 , and the 
thermal expansion coefficients are a Si02 = 0.55 X TO -6 °C“', a S \^ A = 
3 X 10“ 6 °C _1 , calculate the radius of curvature for each wafer. [Note: 
£ Si = 16 X 10" dynes/cm 2 , a Sl =4x 10" 6 °C~\] 

Assume Poisson’s ratio for film and substrate is 0.3. What would the 
radii of curvature be in a 15-cm-diameter wafer? What is the difference in 
height between the edge and center of the wafer? 

7- When sequentially deposited films are all very thin compared with the 
substrate, each film imposes a separate bending moment and separate 
curvature. Since moments are additive, so are the curvatures. 

a. Show that 

1 1 1 1-^6 
— + — + ’ * * H-=- — ((j\d, +cr 9 <7 2 + * ■ ■ + o n d ). 

*1 *2 Rn E S d 2 ^ 

where 1,2, . . . , n denotes the film layer, and a n and d n the film 
stress and thickness. 

o 

b. A 5000-A-thick A1 film is deposited stress free on a 12.5-cm-diameter 
Si wafer (0.5 mm thick) at 250 °C such that there is no stress 
relaxation on cooling to 20 °C. Next, the Al-Si combination is heated 
to 500 °C where A1 completely relaxes. A 2-^m-thick Si 3 N 4 film is 
then deposited with an intrinsic compressive stress of 700 MPa. What 
is the final radius of curvature after cooling to 20 °C? Note the 
following materials properties. 
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Si Al S i ^ N i 4 


E{ GPa) 160 66 150 

ot °C' 1 4 x I0“ 6 23 x 10” 6 3 x 10 -6 


Assume v = 0.3 for all materials. 


8 - Unlike the usual //?/>7-film-//?/c/:-substrate combination treated in this 
chapter, consider thin-film multilayers. For adjacent Films 1 and 2 the 
corresponding Film thicknesses, moduli, and w/istrained lattice parameters 
are d } , E x , <7 0 (1) an ^ d 2 , E 2 , a 0 (2). There is a common lattice parame¬ 
ter, a 0 , at the interface between Films. 


a. What is the strain in each Film? 

b. What are the corresponding stresses? 

c. If the forces are equilibrated, show that 


a o ~ fl o0) tf o(2) 


E 2 d 2 \ / E 2 d 2 

i + ii A< 2 > + 


9 . In Fig. 14-17 the structure of the (250 A) Si—(75 A) Ge 04 Si 06 superlattice 
is shown. The [100] moduli for Ge and Si are E Gc = 141 GPa, E Sl = 181 
GPa and # 0 (Ge) = 5.66 A, fl 0 (Si) = 5.43 A are the corresponding lattice 
parameters. If the properties of Ge 04 Si 06 are assumed to be derived from 
weighted composition averages of pure component properties, find 


a. the common interfacial (in-plane) lattice parameter using the results of 
the previous problem. 

b. the strains and stresses in the Si and Ge 04 Si f)f> layers. 

c. the strained lattice parameters normal to the film layers. 


Assume Poisson’s ratio is 0.37. 


10, Consider a substrate of thickness d s containing deposited Films of thick¬ 
ness dj on either side that are uniformly stressed in tension to a level of 
a f . The substrate is assumed to be uniformly compressed. Film and 
substrate have the same elastic constants. 

a. Determine the substrate stress, assuming force equilibrium prevails. 

b. Show that under the foregoing conditions the net moment with respect 
to an axis at the center of the substrate vanishes. 

c. One Film is totally annealed so that its stress vanishes. The substrate 
and other Film are unaffected in the process. What is the net force 
imbalance or resultant force? What is the net moment imbalance or 
resultant moment? 
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d. In the absence of external constraints the film-substrate will elastically 
deform to find a new equilibrium stress distribution with zero resultant 
force and moment. A uniform force as well as a moment (arising from 
a linear force distribution through the film-substrate cross section) are 
required to counter the mechanical imbalance of part (c). What is the 
stress contribution to the remaining film from the uniform force? What 
is the maximum stress contribution to the remaining film from the 
moment? 

e. What is the new maximum stress in the remaining film and what sign 
is it? 

11. Voids and porosity are sometimes observed in abrupt, compound, diffu¬ 
sion, and mechanical interfaces between films and substrates. Distinguish 
the sources of these defects at these interfaces. Which interfaces are likely 
to contain microcracks? Why? 
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Electrical and Magnetic 
Properties of Thin Films 


10.1. Introduction to Electrical Properties of Thin Films 


10.1.1. General Considerations 

Electrical properties of thin films have long been of practical importance and 
theoretical interest. The solid-state revolution has created important new roles 
for thin film electrical conductors, insulators, and devices. What was once 
accomplished with large discrete electrical components and systems is now 
more efficiently and reliably achieved with microscopic thin-film-based inte¬ 
grated circuit chips. Regardless of the class of material involved, its physical 
state or whether it is in bulk or film form, an electric current of density J 
(amps/cm 2 ) is said to flow when a concentration of carriers n (number/cm 3 ) 
with charge q moves with velocity v (cm/sec) past a given reference plane in 
response to an applied electric field & (V/cm). The magnitude of the current 
flow is expressed by the simple relation 

J = nqv. (10-1) 

For most materials, especially at small electric fields the carrier velocity is 
proportional to $ so that 

v = iiS\ (10-2) 
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The proportionality constant or velocity per unit field is known as the mobility 
p. Therefore, 

J= nqiii , (10-3) 

and by Ohm’s law {J = o$) the conductivity a or reciprocal of the resistivity 
p is given by 

a = 1/p = nqp. (10-4) 

Quantitative theories of electrical conductivity seek to define the nature, 
magnitude, and attributes of the material constants in these equations. Corol¬ 
lary questions revolve about how n and v or p vary as a function of 
temperature, composition, defect structure, and electric field. An alternative 
complementary approach to understanding the response of materials to elec¬ 
trical fields involves electronic band structure considerations that, as noted in 
Chapter 1, have successfully modeled property differences. Comprehensive 
descriptions of conduction integrate what might be termed the “charge carrier 
dynamics” approach with the band structure viewpoint. The former is more 
intuitive and will be adopted here for the most part, but resort will also be 
made to band diagrams and concepts. 

This chapter focuses primarily on the electrical conduction properties of thin 
metal, insulating, and superconducting films. Almost half of the classic 
Handbook of Thin Film Technology , edited by Maissel and Glang, is 
devoted to a treatment of electrical and magnetic properties of thin films. 
Though dated, this handbook remains a useful general reference for this 
chapter. Much of what is already known about bulk conduction provides a 
good basis for understanding thin-film behavior. But there are important 
differences that give thin films unique characteristics and these are enumerated 
here: 

1 . Size effects or phenomena that arise because of the physically small 
dimensions involved— Examples include surface scattering and quantum 
mechanical tunneling of charge carriers. 

2. Method of film preparation —It cannot be sufficiently stressed that the 
electrical properties of metal and insulator films are a function of the way 
they are deposited or grown. Depending on conditions employed, varying 
degrees of crystal perfection, structural and electronic defect concentra¬ 
tions, dislocation densities, void or porosity content, density, grain mor¬ 
phology, chemical composition and stoichiometry, electron trap densities, 
eventual contact reactions, etc., result with dramatic property implications. 
Insulators (e.g., oxides, nitrides) are particularly prone to these effects and 
metals are less affected. 
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3. Electrode effects— Frequently the substrate and a subsequently deposited 
conducting film become the electrodes for the film in question that is 
sandwiched in between. In general, insulating films cannot be considered 
apart from the electrodes that contact their surfaces. The electrical response 
of structures containing insulator (I) or oxide (O) films between metals 
(MIM), semiconductors (SOS), and mixed electrodes (MIS, MOS) is 
strongly influenced by the specific metal or semiconductor electrode mate¬ 
rials employed. Interfacial adhesion, stress, interdiffusion, incorporated or 
adsorbed impurities, are some of the factors that can alter the character of 
charge transport at an interface. 

4. Degree of film continuity— conduction mechanisms in discontinuous, 
island structure films differ from those in continuous films. 

5. Existence of high electric field conduction phenomena— Moderate volt¬ 
ages applied across very small dimensions conspire to make high field 
effects readily accessible in films. 

6. High chemical reactivity—films are susceptible to aging or time-depen¬ 
dent changes in electrical properties due to corrosion, absorption of water 
vapor, atmospheric oxidation and sulfidation, and low-temperature solid- 
state reactions. 

In virtually all cases, thin metal films are more resistive, while insulating 
films are more conductive than their respective bulk counterparts. For metals 
the differences between film and bulk electrical properties are relatively small; 
in insulators the differences can be huge. Why this is so will unfold in the 
ensuing pages. 


10.1.2. Measurement of Film Resistivity 

A number of techniques have been employed to measure electrical properties 
of thin films. Some are adaptations of well-known methods utilized in bulk 
materials. For insulating films, where current flows through the film thickness, 
electrodes are situated on opposite film surfaces. Small evaporated or sputtered 
circular electrodes frequently serve as a set of equivalent contacts; the substrate 
is usually the other contact. If charge leaks along the surface from contact to 
contact, circumventing through-film conduction, then a guard electrode is 
required. 

For more conductive metal and semiconductor films, it is common to place 
all electrodes on the same film surface. Such measurements employ four 
terminals-two to pass current and two to sense voltage. Several contact 
configurations shown in Fig. 10-1 are suitable for this purpose. Through the 
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a. 



Figure 10-1. Techniques for measuring thin-film electrical resistance, (a) Four- 
terminal method for conducting stripes. Current is passed through outer terminals, and 
voltage is measured across inner terminals, (b) Four-point probe method for measuring 
sheet resistance, (c) Van der Pauw method for measuring resistivity of arbitrarily 
shaped film. 

use of lithographic patterning methods (Chapter 14), long stripes can be 
configured with outer current and inner voltage leads that are extensions of the 
film itself (Fig. 10-la). In this way contact resistance problems are eliminated. 
Independent measurements of voltage and current yield the film resistance 
through Ohm’s law, and the resistivity if film dimensions are known. 

A very common way to report values of thin-film resistivity is in terms of 
sheet resistance with units “ohms per square.” To understand this property 
and the units involved, consider the film of length /, width w, and thickness d 
in Fig. 10-2. If the film resistivity is p, the film resistance is R = p// wd . 
Furthermore, in the special case of a square film (/ = w), 

R = R s = p / d ohms/D, (10-5) 

where R s is independent of film dimensions other than thickness. Any square, 
irrespective of size, would have the same resistance. As an example, consider 
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Figure 10-2. Thin-film conductor with length /, width w, and thickness d. 

a film stripe measuring 3 /xm x 30 /xm with R s = 15 fi/D. The overall 
stripe resistance is 10 X 15 = 150 fi because there are 10 squares (3 /im X 
3 /xm) in series. 

A very convenient way to measure the sheet resistance of a film is to lightly 
press a four-point metal-tip probe assembly into the surface as shown in Fig. 
10-lb. The outer probes are connected to the current source, and the inner 
probes detect the voltage drop. Electrostatic analysis of the electric potential 
and field distributions within the film yields 

R s = KV/I, (10-6) 

where A" is a constant dependent on the configuration and spacing of the 
contacts. If the film is large in extent compared with the probe assembly and 
the probe spacing large compared with the film thickness, K = 7r/ln2 = 4.53. 
Otherwise correction factors must be applied (Ref. 1). Four-point probe 
assemblies are available commercially with square as well as the more com¬ 
mon linear contact arrays. 

Van der Pauw (Ref. 2) has devised a perfectly general method to determine 
the resistivity of a film using four probes located arbitrarily at points 
A, B, C, D on the surface (Fig. 10-lc). Two sets of current-voltage measure¬ 
ments using alternate contacts are required. The value of p can be extracted 
from the formula 


Td V CD 7 rd V DA 

exp-+ exp-= 1. 

P I ab P Ibc 


(10-7) 


10,2, Conduction in Metal Films 


10.2.1. Matthiessen’s Rule 

Much can be learned about electrical conduction in metals from Matthiessen’s 
rule. Originally suggested for bulk metals, it is also valid for thin metal films. 
It simply states that the various electron scattering processes that contribute to 
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TEMPERATURE (K) 

Figure 10-3. Schematic electrical resistivity dependence as a function of temperature 
for a metal. Thermal, impurity, and defect contributions are shown. 

the total resistivity ( p T ) of a metal do so independently and additively; i.e., 

Pt = ^Th "T Pi T Pp (10-8) 

The individual thermal, impurity, and defect resistivities are p Th , p n and p D , 
respectively. Electron collisions with vibrating atoms (phonons) displaced from 
their equilibrium lattice positions are the source of the thermal or phonon (p-^) 
contribution, which increases linearly with temperature. This causes the well- 
known positive temperature coefficient of resistivity, an unambiguous sign of 
metallic behavior. Impurity atoms, defects such as vacancies, and grain 
boundaries locally disrupt the periodic electric potential of the lattice. Due to 
atomic valence and size differences at these singularities, electrons are effec¬ 
tively scattered. However, the contributions from p, and p D are temperature 
independent, as shown in Fig. 10-3 if the concentration of impurities and 
defects is low. 

As an application of Matthiessen’s rule, consider the problem of assessing 
the purity and defect content of “pure” metal films. A measure of these 
properties is the residual resistivity ratio or RRR. The latter is defined by 
RRR = p r (300 K)/p r (4.2 K) and is readily determined experimentally by 
measuring the resistivity at the two indicated temperatures. For relatively pure 
annealed metal films at 300 K, Pxh > P/ + P D - At 4.2 K, however, lattice 
vibrations are frozen out and thermal scattering is dramatically reduced 
compared to impurity and defect scattering; i.e., p, + p D > p Th . Therefore, to 
a good approximation, 

RRR ~ p Th (300 K)/p, + p D { 4.2 K), (10-9) 

and it is clear that the purer and more defect-free the film is, the higher the 
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ratio is. Resistivity ratio determinations of sputtered films as well as sputtering 
target metals have been a useful way to assess their quality. Measured values 
of RRR broadly range from below 10 to several thousand. For extremely pure 
metal films, chemical analysis methods fail because of limited sensitivity, and 
the value of RRR is the only practical means available for determining purity. 

An alternative formulation of Matthiessen’s rule in terms of the respective 
mean free-electron lengths (X) between collisions has the form 

1/A r = l/X Th + 1/X,+ 1/X^. (10-10) 

This result stems from the intuitive direct proportionality between o and X, so 
that p, oc 1/X,. Theory has suggested that \ T can be hundreds to thousands of 
angstroms—lengths difficult to reconcile with classical models of vibrating 
atoms. Clearly quantum effects are involved here. In films of thickness d that 
are sufficiently thin so that d < \ T , the possibility arises that a new source of 
scattering—that due to the film surfaces—can increase the measured film 
resistivity. It is to this subject that we now turn our attention. 


10.2.2. Electron Scattering from Film Surfaces 

The problem of electron scattering from thin metal film surfaces has been of 
interest for almost a century; even today the subject is not free of controversy. 
Critical issues revolve about the nature of the film surface and what happens 
when an electron fails to execute the full mean-free path because its motion is 
interrupted through collision with the film surface. The electrons then undergo 
either specular or diffuse scattering. In specular or elastic scattering the 
electron reflects in much the same way a photon does from a mirror. In this 
case one can imagine removing the surface and doubling the film thickness (or 
tripling it for two surface reflections, etc.). The electron now continues on an 
imaginary straight path to complete the path it would have in bulk, finally 
scattering at a point interior to the extended surface. If this happens, the film 
resistivity is the same as in the bulk, and there is no film thickness effect on 
resistivity. When scattering is totally nonspecular, or diffuse (inelastic), the 
electron mean-free path is terminated by impinging on the Film surface. After 
a surface collision the electron trajectory is independent of the impingement 
direction, and the subsequent scattering angle is random. The resistivity rises 
because fewer electrons flow through the reference plane registering current 
flow. 

Reducing the geometry in one dimension creates an additional film contribu¬ 
tion, i.e., Xy, to be added to the other X in Eq. 10-10; if Xy is smallest, then it 



458 


Electrical and Magnetic Properties of Thin Films 



Figure 10-4. Diffuse scattering of electrons from film surface in Thomson model. 


will dominate the effective film resistivity. Thomson (Ref. 3), in 1901, 
provided a simple way to visualize this size effect in terms of classical physics. 
Consider a metal whose bulk conductivity is characterized by which is 
larger than the film thickness d, as shown in Fig. 10-4. For an electron 
starting at point P, a distance z from the film surface moving in a direction 
making an angle 9 with the vertical, the effective mean-free path is variable, 
depending on 0 , and is given by 


X = 


d - z 
cos 9 

\) * 

-z 


0 < 0 < 0 , , 

d - z 

0, < 9 < 9 2 cos 9 j = ■■—, (10-11) 

N) 


cos 9 


9 2 < 9 < 7T 


cos 9 2 = 


z 


The mean value of X (i.e., \ f ) is obtained by averaging X over all angles for 
all distances z. Therefore, 

1 rd /■* 

\ f = — I dz X sin 9d9. (10-12) 

2d J$ J 0 


Breaking the total limits of integration into the three angular wedges dictated 
by Eq. 10-11, we obtain, after integration, 


d j Xo 3 
X,= - In— + - 
s 2 l d 2 


(10-13) 


Finally, the film conductivity a f and resistivity p f relative to bulk values are 
given by 


°f _ fto 
a o Pf 



(10-14) 


where k = d /\. Clearly, as d or k shrinks, X^ is diminished and p f rises; a 
size effect is exhibited. 
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A more accurate quantum theory of thin-film conductivity was developed by 
Fuchs in 1938 (Ref. 4) and elaborated upon in the ensuing half-century by 
other investigators, most notably Sondheimer (Ref. 5). The Fuchs-Sondheimer 
(F-S) theory removed the shortcomings in the Thomson development by 
considering the quantum behavior of the free electrons, the statistical distribu¬ 
tion of their X values in bulk, and the fact that many electron mean-free paths 
originate at the film surface. The specific details of the calculation for the film 
resistivity are beyond our scope here, but the exact solution to the film 
conductivity (resistivity) according to Fuchs for pure diffuse scattering is 

p ° i 3 r ■ 

— = — = 1-/ sin <£ cos </>— 

ff 0 p f 4 J 0 d 

(10-15) 


1 - exp - 


Xq cos (f) 


d<t>. 


For very thin films (\ 0 > d or k < l)a useful approximation is 

— = — - — k (in— + 0.423), (10-16) 

*o Pf 4 l *c ) K ' 

a result that preserves the character of the Thomson equation. In thicker films 
or coatings (k > 1), 

o f jo 0 = p 0 /p f ~ 1 - 3/8 k. (10-17) 

A more realistic model of scattering envisions an admixture of both specular 
and diffuse contributions, with P being the fraction of surface scattering 
events that are specular. Under such circumstances approximate formulas for 
thin and thick films are, respectively (Ref. 6), 

— = — = -k(1 + 2P) (in— + 0.423), (10-18) 

<j 0 P/ 4 \ k I 



The results are plotted as p f /p 0 vs. log k in Fig. 10-5a for various values of 
P. For purely specular scattering (P = 1), p f =p Q irrespective of film 
thickness or k. In contrast, for diffuse scattering (P = 0), the extent that p f 
exceeds p Q is solely dependent on k . Maximum resistivity enhancements are 
attained for the largest \ values (achieved at the lowest temperatures) and the 
smallest d values. Experimentally, however, it is difficult to maintain continu¬ 
ous films below about k = 0.05. 
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Similar expressions have been developed for other related transport proper¬ 
ties in thin films such as the temperature coefficient of resistivity, magnetore¬ 
sistance, Hall effect, and assorted thermoelectric effects (Refs. 5-7). 


10.2.3. Grain-Boundary (GB) Scattering 


Grain boundaries, across which crystal orientations change, are expected to be 
effective electron scatterers and behave like film interfaces in this regard. As 
the crystallite size becomes smaller than the electron mean-free path, a 
contribution to the film resistivity from GB scattering arises. Mayadas and 
Shatzkes (M-S) (Ref. 8) have treated this effect, assuming the average grain 
diameter is equal to the film thickness and that only GBs whose normals lie in 
the film plane contribute to the scattering. The theory predicts that the GB 
resistivity p gh is essentially given by 


Pgb 



-p 4- (3 2 - ft 3 \r\ 



( 10 - 20 ) 


The new parameter, /3, related to the relaxation time for GB scattering, 
depends on the grain geometry and its scattering power. Explicitly, 


where D is the average grain diameter and R (0 < R < 1) is a GB electron 
reflection parameter. In the M-S theory, R plays a similar role to P in the 
F-S equations. If R = 1, the film resistivity is infinite because electrons are 
reflected back to where they originated. In addition, R depends on GB type, 
impurity adsorption at GBs, and the intergrain geometry and radius of curva¬ 
ture. Analysis of experiments have revealed that values for R are 0.24 for Cu 
and 0.17 for Al. 

The complex picture that emerges is that the film resistivity arises from 
three scattering mechanisms: one due to bulk scattering (phonons, impurities, 
and point defects), a second due to the film surfaces, and a third due to GBs. 
Comparison of the F-S and M-S theories is made in Fig. 10-5b when D = d. 
A significant conclusion is that it is difficult to separate surface from GB 
scattering effects. Film-thickness-dependent resistivity effects in polycrys- 
talline films could very well be due to GB rather than surface scattering. 


10.2.4. Comparison with Experiment 

A detailed review of the experimental data to date led one critic in 1983 to 
conclude “ . . .there are virtually no studies on the resistivity of thin metal 
films from which useful values of /o 0 \) (or p f and P) may be deduced” (Ref. 
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9). Among the reasons for this pessimistic assessment are the following: 

1. The structure and morphology of the films have only rarely been character¬ 
ized. 

2. Each film interface possesses a different surface roughness and degree of 
specularity. These factors have been taken into account in more sophisti¬ 
cated theories by adding new parameters which are difficult, if not impossi¬ 
ble, to compare with experiment. 

3. The role of scattering from lattice defects has not been established. 

4. Electron scattering from grain boundaries has not always been taken into 
account (especially in the early work) despite the fact that polycrystalline 
rather than epitaxial films are usually involved. 
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Figure 10-6. Temperature dependence of resistivity of CoSi 2 films. The 125-A and 
197-A films are epitaxial. The 1100-A film is polycrystalline. (From Ref. 10). 




10.2. Conduction in Metal Films 


463 


The critic might have been more charitable in assessing a more recent study 
(Ref. 10) designed to critically test the extent of specular scattering in metallic 
CoSi 2 films. Epitaxial films were prepared by first depositing Co on atomi¬ 
cally clean Si wafers under ultrahigh-vacuum conditions. This was followed by 
vacuum annealing to promote silicide formation. Film thickness and stoichiom¬ 
etry were determined to an accuracy of - 5% by RBS measurements. 
Cross-sectional TEM methods revealed that the CoSi 2 -Si interface was nearly 
atomically perfect (Fig. 1-4), whereas the outer surface was extremely smooth. 
Independent low-temperature magnetoresistance measurements indicated that 
Xq was 970 A. Resistivity data recorded as a function of thickness and 
temperature are shown in Figs. 10-5a and 10-6. They show little dependence 
on thickness down to 60 A. An average degree of specularity of about 90% 
from both the free surface and CoSi 2 -Si interface is suggested. Few data on 
metal films exhibit such a small size effect or high specularity for the d /\ 
range explored. Although these results cannot be generalized to other metal 
film systems, the experimental approach, and film preparation and characteri¬ 
zation techniques are worthy of emulation in future studies of this kind. 


10.2.5. Thin-Film Resistors 

Applications for thin- and thick-film resistors are numerous and varied. They 
include microelectronic circuits of all kinds, hybrid circuits consisting of 
combinations of integrated circuits and discrete components, electromechanical 
sensors (e.g., strain gauges), chemical sensors, heating elements, etc. To meet 
these needs, a broad range of R s (e.g., 10-1000 Q/ □) corresponding roughly 
to p ranging from 50 to 2000 ptfi-cm, is required. In resistor design, film 
length, width, thickness, and R s values must be optimized. Clearly, high 
resistance values necessitate long, narrow, and thin geometries fabricated from 
high-/^ materials. On the other hand, applications such as interconnections 
and semiconductor contacts require very low R s values. In addition, resistors 
must frequently possess a low temperature coefficient of resistivity (TCR) 
(< 100 ppm/°C) and be stable with respect to environmental degradation. 
Some of the common materials together with their electrical properties are 
listed in Table 10-1. The negative values of TCR mean that the materials, e.g., 
Ta, Cr-Si, are not truly metallic when sufficient O or N is incorporated during 
processing, such as reactive evaporation or sputtering. Among the more 
noteworthy materials in this table are Ta and In 2 0 3 :Sn. Together with Ta 2 0 5 , 
a common capacitor dielectric, it has been possible to completely base a 
thin-film RC circuit technology on Ta. Indium-tin oxide (ITO) films have the 
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Table 10-1. Electrical Properties of Thin- and Thick-Film Resistor Materials 


Material 

Resistivity 

TCR 

TCR Range 


/ifl'Cm 

ppm/°C 

°C 

Pd-Ag 

38 

±50 

0 to 100 

Ni(80)Cr(20) 

110 

±85 

-55 to 100 

Ni(76)Cr(20)Al(2)Fe(2) 

133 

± 5 

- 65 to 250 

Ta (a-BCC) 

25-50 

+ 500 to + 1800 


TaN* 

' 250 

-100 


Cr-SiO(lO) 

- 400 

-300 

200 °C 

In 2 0 3 :Sn 

- 10 2 -I0 4 



Cr-SiO(40) 

- 3500 

-300 

200 °C 

Sn0 2 (undoped) 

- 10 4 

- 400 to - 900 


Sn0 2 (doped) 

~ 10 2 -10 1 




•Values depend strongly on composition. 
From Ref. 1 1. 


interesting property of being transparent conductors. They have a number of 
important uses and are discussed more fully in Section 11.2.5. 

Metal oxide mixtures are known as cermets and can have widely varying p 
and TCR values, depending on metal content (Ref. 12). To a large extent, the 
laws of mixtures govern whether more conducting or insulating behavior 
dominates. An important class of these materials has found wide use in 
thick -film microcircuitry applications. Thick-film resistor compositions consist 
of a resistive component, a glassy phase that acts as the binder and an organic 
suspension medium. These materials in the form of inks or pastes are applied 
by a silk-screen-like process to ceramic substrates and sintered at high tempera¬ 
tures, thus driving off the organics. The active resistor ingredient is usually 
one of the following: Pd-PdO-Ag, precious metals other than Pd and Ag, 
RuO, T1 2 0 3 , ln 2 0 3 , Sn0 2 as well as W-Ta carbides, and Ta-Ta nitrides. 
Compositions are available having fired sheet resistivities ranging from 1 Q / □ 
to 10 MO/n and TCR values from -1-500 ppm/°C to -800 ppm/°C. 
Typical resistors are 20-30 ^m thick and have somewhat wider widths. 


10.3. Electrical Transport in Insulating Films 


10.3.1. Introduction 

Insulating films are required to electrically isolate conducting components or 
devices as well as serve as the dielectric within capacitors. Examples of such 
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materials include oxides such as Si0 2 , A1 2 0 3 , and Ta 2 0 5 and nitrides such as 
Si 3 N 4 and AIN. The traditional picture of an insulator is a material that 
possesses very few free charge carriers at practical temperatures. This is a 
consequence of the large energy band gap. In practice, however, insulating 
films are generally amorphous and the usual model of sharply defined energy 
bands can not be readily applied; rather, fuzzy tails arise both at the top of the 
valence band and bottom of the conduction band. These extend into the gap 
and overlap to form a continuous electron state distribution. When this happens 
insulators assume semiconductor-like properties. Due to structural imperfec¬ 
tions there may actually be a relatively high density of charge carriers. They 
tend to be localized or trapped at these centers for long times, and the 
insulating behavior stems from such carriers having very low mobilities. 

A rich array of electrical conduction phenomena is displayed by insulating 
films when sandwiched between combinations of metal and semiconductor 
electrodes. Under the influence of an applied electric field, electrons, holes, 
and ions may migrate giving rise to measurable currents. The identification of 
the dominant transport mechanism—the one that is rate-limiting—is the key to 
understanding the relationships between the material parameters of the insula¬ 
tor and contacts, and the resulting current-voltage characteristics. Two broad 
categories of rate-limiting transport mechanisms have been identified (Refs. 
13-15). 

/. Barrier-Limited . These mechanisms are operative in the vicinity of the 
contact/insulator interface. Conduction is limited by the transfer of charge 
from the contact to the insulator; once charge is injected it has little difficulty 
migrating to the other electrode. Schottky emission and tunneling are the most 
important examples of barrier-limited conduction. 

2. Bulk-Limited . In this case sufficient numbers of charge carriers are 
injected into the insulator conduction band by Schottky emission or tunneling. 
However they experience difficulty in reaching the other electrode because of 
bulk transport limitations. Examples include space charge-limited, intrinsic, 
and Poole-Frenkel conduction mechanisms. 

Table 10-2 enumerates some barrier- and bulk-limited conduction mecha¬ 
nisms. The specific theoretical current (7)-voltage or field (£) characteris¬ 
tics, together with explicit material constants that can be extracted from 
measurement, are listed. There are two outstanding features of current trans¬ 
port in insulating films. The J-i characteristics are frequently nonohmic, 
and, with the exception of tunneling, the conduction mechanisms are thermally 
activated. Band diagram and lattice potential models have provided useful ways 
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Table 10-2. Conduction Mechanisms in Insulators 


Experimentally 

Derivable 

Material 

Mechanism J-S Characteristics Constants 


1. Schottky 
Emission 


J s = A T 2 exp 


Q^b 

kT 


exp 


1 

AT 



2. Tunneling 

3. Space 
Charge 
Limited 

4. Ionic 
Conduction 

5. Intrinsic 
Conduction 

6. Poole-Frenkel 
Emission 


J T 


<? 2 <5- 2 

87t//$ b 


exp 


*^SCL 


9^, i 2 

8 d 


8i(2m) l/2 

3 hqS 




A - 

An 

Jp F 


a S '' 

Jf exp 


£j 

AT 


bT 3/2 vxp 


c $ exp - 


2kT 


kT 


exp 


i 

l 

AT 



( 10 - 21 ) 

( 10 - 22 ) 

(10-23) 

(10-24) 

(10-25) 

(10-26) 




*1 


E\ 


a, b y c = constant. 

c", = insulator dielectric constant. 




Figure 10-7. Barrier limited conduction mechanisms, (a) Schottky emission; (b) 
tunneling. 
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Figure 10-8. Bulk-limited conduction mechanisms. (Dotted lines refer to S - 0) (a) 
space-charge-limited; (b) ionic conduction of cations © ; (c) Poole-Frenkel. 

to picture the transport processes. These are depicted in Fig. 10-7 for 
barrier-limited and in Fig. 10-8 for bulk-limited transport mechanisms. 


10.3.2. Specific Electrical Transport Mechanisms 

It is worthwhile to comment briefly on each mechanism separately. 

/. Schottky Emission . The resemblance of this mechanism to thermionic 
emission of electrons from a heated metal into vacuum should not go unno¬ 
ticed. Thermionic emission is described by 

j te = AT 2 c \p - q<t> M /kT , (10-27) 

where <j> M is the work function and A is the Richardson constant equal to 120 
A/cm 2 -K 2 . Typically, q<i> M is equal to 4-5 eV. Rather than acquire this 
energy, electrons need only surmount the smaller energy barrier q<P B to access 
the empty conduction-band states of the insulator. Values of depend on 
the particular metal-insulator or metal-semiconductor contact in question as 
shown in Fig. 10-9. 

The origin of the barrier lowering or Schottky effect in high electric Fields is 
due to the continuously varying rather than abrupt electron potential energy 
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METAL WORK FUNCTION q<b M (eV) 

Figure 10-9. Barrier height energies of various metals on Si0 2 , GaAs, and Si vs. 
metal work function. (From Refs. 10, 16, and 17). 

distribution at the metal-insulator junction. When the electron enters the 
insulator, charge is redistributed on the electrode to maintain equipotential 
surfaces. The result is to produce a so-called image field that adds to the 
applied field and helps to round and lower the potential barrier (see Fig 10-7a). 

2. Tunneling . Tunneling involves charge transport through an insulating 
medium separating two conductors that are extremely closely spaced. The 
phenomenon need not involve planar electrodes. Electrical conduction in 
discontinuous metal films occurs by electron tunneling between islands when 
the interparticle spacing is small enough. At sufficiently high values of £> 
electrons penetrate or burrow horizontally (at constant energy) into the vacant 
insulator conduction-band states rather than surmount $ B vertically as in 
Schottky emission. Tunneling cannot occur classically but is rather a quantum 
mechanical effect. Because the metal wave functions extend into the insulator, 
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there is a probability that electrons can exist there. If the insulator is thin 

o 

enough, say - 30 A, electron wave functions can even extend into the 
opposing metal. The phenomenon of tunneling is important in other applica¬ 
tions discussed in this book (e.g., in the scanning tunneling microscope p. 311, 
in superconductors p. 483, and in tunneling transistors p. 672. In addition, 
both Schottky emission and tunneling mechanisms are operative at metal-semi¬ 
conductor contacts (Section 10.4.1). 

3. Space-Charge-Limited (SCL) Conduction . This bulk-limited mecha¬ 
nism occurs because the rate of carrier injection from the contact exceeds the 
rate at which charge can be transported through the film. A space-charge cloud 
develops that discourages further charge injection and leads to nonlinear 
conduction effects. At low levels of carrier injection Ohm's law is obeyed. If 
traps are present, the SCL current will be reduced because carriers are 
removed by empty traps. For a uniform distribution of trapping states with 
energies distributed throughout the band gap, a thermally activated current 
flows proportional to $ 2 /d ■ exp - oti / kT, where a is a constant. This type 
of conduction has been observed in thin polymer films. 

4 . Ionic Conduction . High-temperature conduction in thick-film and bulk 
insulators frequently occurs by ionic rather than electronic motion. Ions are 
large cumbersome carriers of low mobility. They require high activation 
energies (i.e., - 1-3 eV) to execute nearest-neighbor diffusive jumps, and 
when ions diffuse they also transport charge. Since ionic transport is basically 
solid-state electrolysis, atomic or molecular species will necessarily be dis¬ 
charged at the contacts. An important example of ionic conduction is the 
migration of sodium ions in thin Si0 2 films. This phenomenon leads to 
operating instabilities in field effect transistors. 

5. Intrinsic Conduction . This mechanism involves direct electronic excita¬ 
tion from the valence to conduction band. Because E g is large in insulators, 
intrinsic conduction is negligible at low temperatures. 

6 . Poole-Frenkel Emission . Bulk charge trapped by impurity levels in the 
insulator band gap can be transferred to the conduction band by internal 
emission processes. In Poole-Frenkel emission the impurity is viewed as an 
ionized donor with a hydrogen-like Coulomb potential. When filled by an 
electron, the defect center is neutral. In an applied field the potential well 
distorts asymmetrically, as shown in Fig. 10-8. Trapped electrons can escape 
by thermal activation over the reduced energy barrier corresponding to that 
required to ionize the donor. Net current flows due to successive electron 
jumps between Poole-Frenkel traps. Note the factor of 2 difference between 
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Poole-Frenkel and Schottky emission that arises due to consideration of a 
fixed rather than image charge. 

A variant of Poole-Frenkel emission involves quantum mechanical tunnel¬ 
ing into the insulator conduction band by electrons trapped within the spherical 
donor potential well. The J-$ characteristics resemble those of tunneling for 
this case. 

10.3.3. Comparison with Experiment 

The varied conduction mechanisms insulating films exhibit lead to a rich 
assortment of J-V characteristics. Identification of the dominant type of 

^{xIO 6 V/cm) 



Figure 10-10a. Current (I)-electric field (E) characteristics for Au-Si 3 N 4 -Si 
(yV-0.0005 fl cm) structure at 298 K. Si 3 N 4 film thickness is 5QoA and contact area is 
1.6 x 10" 4 cm 2 . 
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1000/T(K) 

Figure 10-1 Ob. Arrhenius plots of current density versus \/T for Si 3 N 4 , A1 2 0 3 , 
and Si0 2 films at the indicated electric fields (Ref. 1). 

transport generally necessitates measurement over a broad range of experimen¬ 
tal variables, e.g., S\ T and film thickness. Even so, it is not always easy to 
unambiguously specify the operative conduction mechanism. For example, 
temperature effects are generally dominated by the Boltzmann factor, which 
frequently masks the pre-exponential temperature dependence. Thus, 
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In J s j T 2 , In J X T, In J ln / r 3/2 , and In J P¥ each plotted versus 1 / T yields an 
Arrhenius curve of similar appearance. Schottky barrier lowering effects and 
Poole-Frenkel conduction are difficult to distinguish. Probable admixtures of 
conduction mechanisms operating in series or parallel further complicate the 
issue. 

Despite these impediments, careful measurements in specific systems have 
yielded characteristics in agreement with theory. For example, consider the 
data in Fig. 10-10 for Si 3 N 4 films sandwiched between Au and highly doped 
Si electrodes. The high field dc response (Fig. 10-10a) is consistent with 
Poole-Frenkel conduction while at low values of i ohmic (intrinsic) behavior 
(dotted in) is observed. Polarity reversal results in a different response, an 
effect attributable to the difference in <f> s at Au/Si 3 N 4 and Si/Si 3 N 4 inter¬ 
faces. The spectrum of conduction regimes can also be distinguished on the 
basis of temperature as shown in Fig. 10-10b. As the temperature is lowered, 
Poole-Frenkel conduction gives way to ohmic behavior and then to tunneling. 
Simultaneously, the effective activation energy for conduction declines in 
stages until it reaches zero, a certain sign that tunneling dominates. Except for 
the extent of conduction, A1 2 0 3 and Si0 2 films display similar characteristics 
as a function of temperature. Conduction behavior in insulators is often 
investigated as a function of film thickness. It is left to the reader to suggest 
how resultant current-voltage characteristics can be used to identify specific 
conduction mechanisms in this case. 


10 . 4 . Semiconductor Contacts and MOS Structures 


10.4.1. The Metal - Semiconductor Contact 

Good electrical contacts to semiconductors are crucial for the proper function¬ 
ing of circuits because signals enter and leave devices through them. One 
requirement for contacts is chemical stability, a subject already treated in 
Section 8.5.2 in connection with Al-Si. Further, it is essential that the contact 
does not introduce unwanted electrical characteristics such as rectification of 
signals or high resistance into the circuit. Band diagrams provide a convenient 
way to model the electrical response of contacts. As an example, consider the 
metal-7V-type semiconductor contact of Fig. 10-11 where the respective work 
function and Fermi level magnitudes are given by <t> s < <t> M and > E^. 
When both isolated materials are brought into contact, equilibrium of E F 
requires transfer of electrons to the metal. But this depletes the semiconductor 
of electrons in the vicinity of the interface. As a result the semiconductor 
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Figure 10-11. (a) Band diagrams for isolated metal and TV-type semiconductor, (b) 

Schottky barrier formation at metal-semiconductor contact. 


acquires less of an TV-type character, and this is why the valence and 
conduction-band edges bend up at the interface. Beyond — 1 pm or so, 
depending on doping level, the original TV-type semiconductor bands are 
unaltered. 

An important consequence of forming the contact is that a potential energy 
barrier of magnitude q{4> M - <t> s ) is built in at the junction. Readers familiar 
with elementary semiconductor diode behavior will recognize that the contact 
in question will function like a P-N junction that also has a built-in potential. 
Furthermore, for low semiconductor doping levels the dominant electron 
transport mechanism is Schottky emission. The barrier that electrons must 
surmount by thermal excitation is $ B = <j> M - where x is the electron 
affinity. A synthesis of the above characteristics suggests that the contact—alias 
Schottky diode—preserves and integrates both attributes. Under the application 
of bias voltage V 9 the well-known junction diode equation describing the 
current ( J ) response is operative 

J = ./ 0 (exp qV/kT — l), (10-28) 

For this case it is appropriate to let J 0 = AT 2 exp - q$ B /kT. 

Ohmic contacts ideally have linear J-S characteristics and pass current 
equally with either voltage polarity. They are required to have negligible 
resistance and exhibit small voltage drops in comparison to bulk semiconductor 
behavior. A convenient measure of the contact quality is the specific contact 
resistance R c (fl-cm 2 ). It is defined through Ohm’s law by 

R c =(dJ/dV)~' m0 (10-29) 
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N d (cm -3 ) 



Figure 10-12. Calculated and measured values of contact resistance as a function of 
doping level N D . Upper insert shows tunneling. Lower insert shows Schottky emis¬ 
sion. (Reprinted with permission from John Wiley and Sons, from S. M. Sze, 
Semiconductor Devices: Physics and Technology , Copyright © 1985, John Wiley 
and Sons). 

This equation is valid for low semiconductor doping levels. When, however, 
the doping concentration increases, the width of the depletion region shrinks. 
Correspondingly, the probability of electron tunneling from the contact into the 
semiconductor increases, reducing R c in the process. The dependence of R c 
on doping level is shown in Fig. 10-12. A target value of R c ~ 10 -7 Q-cm 2 is 
desired for device applications. Doping semiconductors heavily to levels of 
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10 20 cm' 3 or more is the only practical way to achieve this. The alternative of 
using metals with low $ B values is not feasible. In fact, for a large number 
of otherwise suitable metal and silicide films on TV-type Si ranges from 0.5 to 
0.9 V (Figs. 10-9 and 10-12). For P-type Si, $ B is somewhat lower (0.4 to 
0.6V), but still too high to yield low resistance contacts in the absence of heavy 
doping. 

10.4.2. MOS Structures 

By interposing a thin oxide film between planar metal and semiconductor 
electrodes, we create the technologically important MOS structure. This 
structure is an essential building block of silicon field effect transistors. Many 
excellent texts on semiconductor device physics (Refs. 17, 18) offer a detailed 
account of the electrical behavior of devices containing this structure. Because 
MOS capacitor properties have proven to be valuable in diagnosing the quality 
of the oxide and gate electrode films employed, it is worthwhile to qualitatively 
understand how this structure responds to applied voltages. Let us first 
consider an “ideal’ 7 MOS structure shown in Fig. 10-13. It is ideal because 
both the P-type semiconductor and metal gate have identical work functions. 
Further it is assumed there is no charge at the semiconductor-oxide (S-O) 
interface or within the oxide film proper. This means that with no voltage 
applied Ferini level equilibration leaves all bands horizontal or flat. When a 
negative potential V is applied to the metal, the energy of all of its conduction 
electrons is raised uniformly by an amount qV relative to the semiconductor. 
Holes are attracted to the S-O interface making the semiconductor appear to 
be even more P-type than originally. This causes the band bending at the 
interface shown in Fig. 10-13b, which graphically describes a semiconductor 
state known as accumulation. The high interfacial hole density serves to make 
the semiconductor an effective capacitor plate. For high V values the MOS 
capacitance is solely due to the oxide, and it reaches a maximum value given 
by C 0 = E 0 A 0 /d Qy where e 0 , d 0> and A 0 are the oxide dielectric constant, 
thickness, and area, respectively. 

As the polarity of V is reversed the overall capacitance decreases. The 
reason is that holes are repelled from the S-O interface. In this depletion 
regime the semiconductor acts like a dielectric with a capacitance governed by 
the width of the depletion region (Fig. 10-13c). There are now two capacitors 
in series with a lower overall capacitance than that of either semiconductor or 
oxide alone. (Recall that the overall capacitance is the product of the two 
individual (oxide and semiconductor) capacitances divided by their sum.) Note 
that now the bands bend the other way—the slope of the oxide conduction-band 
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(c) (d) 

ACCUMULATION 



BIAS VOLTAGE 

(e) 

Figure 10-13. Band diagrams for ideal metal oxide P-type Si structure under 
different voltage bias conditions: (a) V = 0; (b) V < 0, accumulation; (c) V > 0, 
depletion; (d) V > 0, inversion; (e) C-V characteristics. 

edge (or electric field) reverses sign, and near the interface the semiconductor 
is, interestingly, acquiring an TV-like character. At still higher positive voltages 
electrons are drawn to the interfacial region where the semiconductor is said to 
be locally inverted. Extreme band bending is characteristic of the inversion 
regime (Fig. 10-13d). None of the material properties, i.e., <i > M , <j> s , or 
E g , are altered by the application of voltage. With inversion, the overall 
capacitance reaches a minimum because the width of the semiconductor region, 
depleted of holes, is maximum; the oxide capacitance of course remains 
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constant throughout. The ideal capacitance-voltage (C-K) characteristics are 
shown in Fig. 10-13e for a high-frequency test signal. In this case all 
incremental charge appears at the edge of the depletion region. At low 
frequencies, however, the minority carriers (electrons) can follow the ac gate 
voltage variation and exchange charge with the inversion layer. The capaci¬ 
tance rises since only C 0 is now involved. Similar but opposite polarity 
considerations apply to N -type MOS structures. 

In silicon-based MOS structures the gate oxide is amorphous Si0 2 grown by 
thermal oxidation in dry 0 2 . Oxides as thin as 100 A are currently used in 
devices. Polysilicon, metal silicides, and combinations known as polycides are 
increasingly employed today in gate electrode structures supplanting the A1 
films of yesteryear. Real MOS structures are nonideal; now there are work 
function differences between the gate and Si, and both mobile oxide and fixed 
interfacial surface charge may exist. The net effect of these contributions is to 
cause initial band bending even with no applied bias. Also, the ideal C-V 
characteristics are shifted along the V axis by an amount equal to the flat-band 
voltage K FB given by (Ref. 17) 



In this formula Q ss is the S-0 interfacial charge density and p(*) is the spatial 
density of oxide charge with x , the distance into the oxide, measured from the 
metal interface. 


10.4.3. Dielectric Breakdown 

Application of very high electric fields across insulating films can sometimes 
lead to local destruction of the material, a phenomenon known as dielectric 
breakdown. This effect has long been studied in bulk as well as thin-film 
insulators and occurs at fields considerably in excess of those causing the 
ohmic or field-dependent, nonohmic conduction considered previously. Break¬ 
down appears to be the result of local injection of excess charge from an 
electrode which then triggers a lightning-like current discharge. In its wake are 
breakdown channels, holes, or melted surface craters, and sometimes even 
metal whiskers or dendrites that bridge and short the electrodes. Studies of 
dielectric breakdown phenomena in films have been performed on a large 
number of oxides (e.g., Si0 2 , A1 2 0 3 , Ta 2 0 5 , Zr0 2 , Hf0 2 , etc.) as well as 
other insulators, such as Si 3 N 4 , diamond, mylar, polymers etc. Typical 
breakdown fields range from 10 5 to 10 7 V/cm so that relatively low applied 
voltages (~ 1-100 V) can damage thin (~ 1000 A) insulators. It is for this 
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reason that technological interest has centered on insulating films employed in 
microelectronics, notably the gate oxide, where dielectric breakdown is a 
serious reliability concern. The remainder of this section will therefore be 
devoted to Si0 2 films (Ref. 19). 

It is generally agreed that electron impact ionization is responsible for 
intrinsic breakdown in Si0 2 films. In this process, electrons colliding with 
lattice atoms break valence bonds, creating electron-hole pairs. These new 
electrons accelerate in the field and through repeated impacts generate more 
electrons. Ultimately a current avalanche develops that rapidly and uncontrol¬ 
lably leads to excessive local heating and dielectric failure. Typical of theoreti¬ 
cal modeling (Refs. 19, 20) of breakdown is the consideration of three 
interdependent issues; 

1. Electrode charge injection into the insulator, e.g., by tunneling (Eq. 10-22). 
This formula connects current and applied field. 

2. The local electric field that is controlled by the relatively immobile hole 
density through the Poisson equation. 

3. A time-dependent change in hole density that increases with extent of 
impact ionization, but decreases with amount of hole recombination or drift 
away. The resulting rate equation depends on both current and field. 

Simultaneous satisfaction of these coupled relationships leads to the predic¬ 
tion that the current-voltage characteristics display negative resistance. This 
appears as a knee in the response above a critical applied voltage and reflects 
a current runaway instability. Another prediction is that the average break¬ 
down field rises sharply as the film thickness decreases. 

Reliability concerns for thin Si0 2 films in MOS transistors have fostered 
much statistical analysis of life-testing results and some typical experimental 
findings include: 

1. The histogram of the number of breakdown failures due to intrinsic causes 
peaks sharply at about 1.1 X 10 7 V/cm (Fig. 10-14a). The failure probabil¬ 
ity is nil below 7 X 10 6 V/cm. 

2. Time to failure (TTF) accelerated testing has revealed that 

0.33 eV 

TTF oc exp-exp - 2.47 V, (10-32) 

kT 

where the exponentials represent individual temperature and voltage accel¬ 
eration factors (Ref. 21). 

3. Contrary to theory, thinner oxides present a greater failure risk. The 
lifetime dependence on oxide film thickness is shown in Fig. 10-14b. Note 
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Figure 10-14. (a) Histogram of number of failures versus applied electric field in 

thin Si0 2 films. (From D. R. Wolters and J. J. van der Schoot, Philips J. Res. 40, 
115, 1985). (b) Time-dependent dielectric breakdown in Si0 2 . (Courtesy of A. M-R 
Lin, AT&T Bell Laboratories.) 
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that a 100-A difference in film thickness signifies a four-order-of-magnitude 
change in failure time. 

4. Film defects cause breakdown to occur at smaller than intrinsic fields and in 
correspondingly shorter times. 

As an example illustrating the use of Eq. 10-32 assume that the lifetime 
of Si0 2 films is 100 h during accelerated testing at 125 °C and 9 V. 
What lifetime can be expected at 25 °C and 8 V? Clearly, TTF = 
100exp0.33/ k{\/T 2 - l/r,)exp - 2.47(K 2 - K,), where T 2 = 298, 7j = 
398, K 2 = 8, V x = 9, and k = 8.63 X 10“ 5 eV/K. Substitution yields a 
value for TTF - 29,700 h. 


10 . 5 . Superconductivity in Thin Films 


10.5.1. Overview 

The discovery in 1986-1987 that superconductivity is exhibited by oxide 
materials at temperatures above the boiling point of liquid nitrogen ignited 
intense worldwide research devoted to understanding and exploiting the phe¬ 
nomenon. For a perspective of superconducting effects in these new materials 
and prospects for thin-film uses, it is worthwhile to view the subject against the 
75-year backdrop of prior activity. This pre-1988 “classical” experience with 
superconductivity will therefore be surveyed first in this chapter (Ref. 22); in 
Chapter 14 high-temperature superconductivity will be discussed. 

Superconductivity was discovered by Kamerlingh Onnes, who, in 1911, 
found that the electrical resistance of Hg vanished below 4.15 K. Actually it 
was estimated from the time decay of (nearly) persistent supercurrents in a 
toroid that the resistivity of the superconducting state does not exceed - 10 20 
12-cm, some 14 orders of magnitude below that for Cu. Some basic attributes 
possessed by superconductors have been experimentally verified and theoreti¬ 
cally addressed over the years. These are briefly enumerated here. 

/. Occurrence of Superconductivity . The phenomenon of superconductiv¬ 
ity has been observed to occur in at least 26 elements and in hundreds and 
perhaps thousands of metallic alloys and compounds. It is favored by a large 
atomic volume or lattice parameter, and when there are between two and eight 
valence electrons per atom. 

2. Critical Temperature and Magnetic Field . The superconducting state 
only exists in a specific range of temperature ( T ) and magnetic field strength 




10.5. Superconductivity in Thin Films 


481 


Table 10-3. Values of T c and H s for Superconducting Materials 


Element 

T c (K) 

tf,(Oe) 

Alloy 

or Compound 

T c (K) 

H s (Oe)** 

A1 

1.19 

98.8 

V 3 Ga 

14.8 

25 x 10 4 

In 

3.41 

285 

V 3 Si 

16.9 

24 x 10 4 

La(0) 

5.9 

1000 

Nb 3 Sn 

18.3 

28 x 10 4 

Nb 

9.2 

2,000*. 3,000** 

Nb 3 Ga 

20.2 

34 X 10 4 

Pb 

7.18 

800 

Nb 3 Ge 

22.5 

38 X 10 4 

Re 

1.70 

200 

PbMo 5 S 8 

14.4 

60 x 10 4 

Sn 

3.72 

308 

NbN 

15.7 

15 x 10 4 

Ta 

4.48 

825 

YBa 2 Cu 3 0 7 

93 

— 

Tc 

8.22 

— 

BiSrCaCuO 

107 

— 

Th 

1.37 

161 

TIBaCaCuO 

120 

— 

T1 

2.39 

170 




V 

5.13 

1290*,7000** 





**//,(«) 

From Refs. 22 and 23. 


(//). Critical values of these quantities are experimentally found to be closely 
described by 

H s = H 0 {l-(T/T c ) 2 ), (10-33) 

where H s is the critical field, H 0 is the maximum field at T = 0 K, and T c is 
the highest temperature at which superconductivity is observed. On an H vs. 
T plot the division between superconducting and normal conduction regimes is 
defined by Eq. 10-33. A temperature spread of only - 10 -3 K (10 -4 K in 
pure metals, 10“ 2 K in alloys) about the value of T c characterizes the 
sharpness of the transition between the two states. Superconductivity can be 
extinguished by exposure to a field greater than H s or by passing a supercur¬ 
rent that induces a magnetic field in excess of H s . Values of T c and H s are 
listed in Table 10-3 for a number of superconducting materials. 

3. Meissner Effect . One of the remarkable features of the superconducting 
state is the Meissner effect. It is characterized by the exclusion of magnetic 
flux and, hence, electrical currents from the bulk of the superconductor. The 
exclusion is not total, however, and both flux and current are confined to a 
surface layer known as the penetration depth A s . The London theory of 
superconductivity indicates that 

\(T) = x,(o)[i - (77 t;) 4 ] 


( 10 - 34 ) 
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where A v (0) is the penetration depth at 0 K. Typically, \ s is 500-1000 A. The 
Meissner effect means that if a superconductor is approached by an H field, 
screening currents are set up on its surface. This screening current establishes 
an equal and opposite H field so that the net field vanishes in the supercon¬ 
ductor interior. The now common displays of permanent magnets levitated 
over chilled high-T c superconductors is visual evidence of the Meissner effect. 

4. Type / and Type // Superconductors . There are two types of supercon¬ 
ductors: type I (or soft) and type II (or hard). With the exception of Nb and V 
the elements are type I superconductors. In such materials the superconducting 
transition is abrupt, and flux penetrates only for fields larger than H s . In type 
II superconductors (exemplified by Nb, V, alloys (e.g., Mo-Re, Nb-Ti), and 
A-15 compounds (e.g., Nb^Sn, Nb 3 Ge)), there are two critical fields H S {1) 
and H s (u ), the lower and upper values. If the applied field is below H s (l ), 
type II behavior is the same as that displayed by type I superconductors. For 
fields above H s (l) but below H s (u ), there is a mixed superconducting state, 
whereas for H > H s {u ), normal conductivity is observed. Importantly, type II 
superconductors can survive in the mixed state up to extremely high H values 
(e.g., in excess of 10 5 gauss). This property has earmarked their use in 
commercial superconducting magnets. In the mixed state, just above H S {1 ), 
flux starts to penetrate the material in microscopic tubular filaments (~ 1000 

o 

A in diameter), known as fluxoids or vortices, that lie parallel to the field 
direction. The core of the fluxoid is normal while the sheath is superconduct¬ 
ing; the circulating supercurrent of the latter establishes the field that keeps the 
core normal. Fluxoids, which are usually arranged in a lattice array, grow in 
size as the field is increased with progressively more flux penetration. Above 
H s (u) y flux penetrates everywhere. The current flow is not entirely lossless in 
the mixed state, however, because a small amount of power is dissipated by 
viscous fluxoid motion. Fluxoid pinning due to introduction of alloying ele¬ 
ments or defects is a practical way to minimize this energy loss. 

5. The BCS Theory . The theory by Bardeen, Cooper, and Schrieffer (BCS) 
(Ref. 24) in 1957 provided the basis for understanding superconductivity at a 
microscopic level, superceding previous phenomenological approaches. Cen¬ 
tral to the BCS theory is the complex coupling between a pair of electrons of 
opposite spin and momentum through an interaction with lattice phonons. The 
electrons that normally repel each other develop a mutual attraction, forming 
Cooper pairs. A measure of the average maximum length at which the phonon 
coupled attraction can occur is known as the coherence length £. Schrieffer 
described the theoretical issue as “how to choreograph a dance for more than a 
million, million, million couples'’ so that they condense into a single state that 
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moves in step or flows like a frictionless fluid (Ref. 24). Since the electron 
coupling is weak, the energy difference between normal and superconducting 
states is small with the latter lying a distance 2A below the former. Thus, a 
forbidden energy gap of width 

2A = 3.5*7; (10-35) 

appears in the density of states centered about the Fermi energy at 0 K. This 
predicted relationship has been verified in many superconductors by tunneling 
measurements, which are described in the next section. 

When the temperature is raised, the amplitude and frequency of lattice 
atomic motion increase, interfering with the propagation of phonons between 
correlated Cooper pairs. The attraction between electrons is diminished and 2A 
decreases. At T = T c , A = 0. Any perturbation in structure or composition 
extending over the coherence length can alter T c or 2A, placing a practical 
limit on useful superconducting behavior. 


10.5.2. Superconductivity in Thin Films; Tunneling 


Thin films have traditionally played a critical role in testing theories of 
superconductivity and in establishing new effects. Superconductivity appar- 

o 

ently persists to film thicknesses of — 10 A. Lower limits are difficult to 
establish because films of such thickness are generally discontinuous. The 
dependence of T c on deposition conditions and film thickness has been studied 
for a long time, and interesting, though not easily predictable or explainable, 
effects have been reported. When either \ s or £ becomes comparable to the 
thin film thickness, deviation from bulk superconducting properties may be 
expected. For example, enhanced superconductivity has been reported in 
vapor-quenched, amorphous Bi and Be films where T c values of 6 and 8 K 
were obtained, even though these metals are not superconducting in bulk. 
Higher T c values with decreasing film thickness have been observed by 
several investigators. The size of these effects ranges from fractions to several 
degrees K and depends on the magnitude and sign of the film stress, impuri¬ 
ties, lattice imperfections, and grain size in generally inexplicable ways. A link 
between T c and the fundamental nature of the material is suggested by the BCS 
formula 


1.14 hv 

T c = - 7 - eX P - 


N(E f )U 


(10-36) 


(for N(E f )U < 1). The quantity N(E F ) is the density of states at the Fermi 
level, U is the magnitude of the attractive electron-lattice interaction, and v 
corresponds to the lattice (Debye) frequency. Normally N(E F )U is weakly 
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sensitive to lattice dimensions and has a value between 0.1 to 0.5. Furthermore 
if N(E f ), U , or v increase, so does T c . However, connections between these 
quantities, on the one hand, and film composition and structure, on the other, 
are uncertain at best. 

The most extensive experimentation in thin films has involved tunneling 
phenomena. Unlike the tunneling between normal metals considered earlier 
(Section 10.3.1), a superconducting tunnel junction consists of two metal 
films, one or both being a superconductor, separated by an ultrathin oxide or 
insulator film. Tunneling currents generally flow when electrons emerge from 
one metal to occupy allowable empty electron states of the same energy in the 
opposite metal. Through application of voltage bias, relative shifts of the entire 
electron distribution of both metals occur, either permitting or disallowing 
tunneling transitions. Thus, if electrons at the Fermi level of a normal metal lie 
opposite the forbidden energy gap at the Fermi level of the superconductor, no 
tunnel current flows. Translation of band states by a voltage A /q, or half the 
energy gap, causes occupied energy levels of the former to line up with 
unoccupied levels of the latter resulting in current flow. If both electrodes are 
the same superconductor, a voltage corresponding to the whole energy gap 
must be applied before tunnel current flows. Current-voltage characteristics 
corresponding to these two cases are shown in Fig. 10-15. A more complicated 
behavior is exhibited when two different superconductors with energy gaps 




Figure 10-15. Current-voltage characteristics of tunnel junctions: tunnel junctions 
(a) one metal normal—one metal superconducting, (b) both metals identical supercon¬ 
ductors. (c) both metals superconducting but with different energy gaps, (d) Josephson 
tunneling branch (1) and normal superconducting tunneling branch (2). 7 0 is the critical 
junction current density. 
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2A, and 2A 2 are paired. By yielding precise values of 2A, such measurements 
have provided direct experimental verification of the BCS theory. 

One of the very important advances in superconductivity was the remarkable 
discovery by Josephson (Ref. 25) that supercurrents can tunnel through a 
junction. Thus tunneling of Cooper pairs and not only electrons is possible. 

D 

Two superconducting electrodes sandwiching an ultrathin insulator — 50 A 
thick are required. The current-voltage characteristic has two branches (Fig. 
10-15d). The normal tunneling branch is similar to Fig. 10-15c but with a 
reduced negative resistance feature. The Josephson tunneling current branch 
consists of a current spike; no voltage develops across the superconducting 
junction in this case. Because the Josephson current is extremely sensitive to H 
fields, the junction can be easily switched from one branch to the other. 

Josephson devices known as SQUIDs (superconducting quantum interfer¬ 
ence devices) capitalize on these effects to detect very small H fields or to 
switch currents at ultrahigh speed in computer logic circuits. These applica¬ 
tions will be described in more detail in Section 14.8.3. 


10 . 6 . Introduction to Ferromagnetism 


The remainder of this chapter is devoted to some of the ferromagnetic 
properties of thin films (Refs. 26, 27). We start with the idea that magnetic 
phenomena have quantum mechanical origins stemming from the quantized 
angular momentum of orbiting and spinning atomic electrons. These circulat¬ 
ing charges effectively establish the equivalent of microscopic bar magnets or 
magnetic moments. When neighboring moments due to spin spontaneously and 
cooperatively order in parallel alignment over macroscopic dimensions in a 
material to yield a large moment of magnetization (A/), then we speak of 
ferromagnetism. The quantity M is clearly a vector with a magnitude equal to 
the vector sum of magnetic moments per unit volume. In an external magnetic 
field (H) the interaction with M yields a field energy density (E H ) given by 

E n = -H M. (10-37) 

However, no external field need be applied to induce the ferromagnetic state. 
The phenomenon of ferromagnetism has a number of characteristics and 
properties worth noting at the outset. 

1. Elements (e.g., Fe, Ni, Co), alloys (e.g., Fe-Ni, Co-Ni), oxide insula¬ 
tors (e.g., nickel-zinc ferrite, strontium ferrite) and ionic compounds (e.g., 
CrBr 3 , EuS, Eul 2 ) all exhibit ferromagnetism. Not only are all crystal 
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structures and bonding mechanisms represented, but amorphous ferromagnets 
have also been synthesized (e.g., melt-quenched Fe 80 B 20 ribbons and vapor- 
deposited Co-Gd films). 

2. Quantum mechanical exchange interactions cause the parallel spin align¬ 
ments that result in ferromagnetism. It requires an increase in system energy to 
disorient spin pairings and cause deviations from the parallel alignment direc¬ 
tion. This energy, known as the exchange energy (E ex ), is given by 

E ex =A x (V<t,) 2 (10-38) 

and is a measure of the “stiffness" of M or how strongly neighboring spins 
are coupled. The exchange constant A x is a property of the material and equal 
to - 10 -6 ergs/cm in Ni-Fe. Avoidance of sharp gradients in <t> , the angle 
between M and the easy axis of magnetization, leads to small values of E ex . 

3. Absorbed thermal energy serves to randomize the orientation of the spin 

moments (i s . At the Curie temperature (T c ) the collective alignment collapses, 
and the ferromagnetism is destroyed. By equating the thermal energy absorbed 
to the internal field energy (/^// y ), i.e., kT c = values of H i can be 

estimated. The internal field //, permeating the matrix is established by 
exchange interactions. Typically, H i is predicted to be in excess of 10 6 Oe, an 
extremely high field. 

4. Magnetic anisotropy phenomena play a dominant role in determining the 
magnetic properties of ferromagnetic films. By anisotropy we mean the 
tendency of M to lie along certain directions in a material rather than be 
isotropically distributed. In single crystals, M prefers to lie in the so-called 
easy direction, say [100] in Fe and [111] in Ni. To turn M into other 
orientations, or harder directions, requires energy (i.e., magnetocrystalline 
anisotropy energy (E K )). Consider now a fine-grained poly crystalline ferro¬ 
magnetic film of Permalloy (~ 80 Ni-20 Fe). Surprisingly, it also exhibits 
anisotropy with M lying in the film plane. In such a case E K is a function of 
the orientation of M with respect to film coordinates. For uniaxial anisotropy 

E k = K u sm 2 6, (10-39) 

where K u is a constant with units of energy/volume, and 0 is the angle 
between the in-plane saturation magnetization and the easy axis. The source of 
the anisotropy is not due to crystallographic geometry but rather to the 
anisotropy arising from shape effects (i.e., shape anisotropy). 

When ferromagnetic bodies are magnetized, magnetic poles are created on 
the surface. These poles establish a demagnetizing field ( H d ) proportional and 
antiparallel to M, i.e., H d = - NM , where N is known as the demagnetizing 
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factor and depends on the shape of the body. For a thin Film, N ** 4x in the 
direction normal to the film plane. Therefore, H d = -4 ttM. In evaporated 
Permalloy films H d can be as large as - 10 4 Oe. However, in the film plane 
H d is much smaller so that M prefers to lie in this plane. There are other 
magnetic films of great technological importance—garnets for magnetic bubble 
devices (Section 10.8.4) and Co-Cr for perpendicular magnetic recording 
applications (Section 14.4.3)—where M is perpendicular to the film plane. 
Associated with H d is magnetostatic energy (E M ) of amount 

E M =(\/2)H d M (10-40) 

per unit volume. The 1/2 arises because self-energy is involved, i.e., H d is 
created from the distribution of M in the film. In the hard direction the 
energy density is therefore 

E m = 2tM 2 . (10-41) 

The origins of anisotropy are complex and apparently involve directional 
ordering of magnetic atom pairs, e.g., Fe-Fe. Film anisotropy is affected by 
film deposition method and variables, impingement angle of the incident vapor 
flux, applied magnetic fields during deposition, composition, internal stress. 


M 



Figure 10-16. Schematic hysteresis loops for soft and hard magnetic materials. For 
soft magnets H r < 0.05 Oe. For hard magnets H c > 300 Oe. (From Ref. 28). 
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and columnar grain morphology, in not readily understood ways. Even amor¬ 
phous ferromagnetic films exhibit magnetic anisotropy. 

5. Not all ferromagnetic materials are magnets or have the ability to attract 
other ferromagnetic objects. The reason is that the matrix decomposes into an 
array of domains, each of which has a constant M but is differently oriented. 
Hence, Y.M = 0 over macroscopic dimensions. Domains facilitate magnetic 
flux closure and reduce stray external magnetic fields—effects that minimize 


Table 10-4. Properties of Soft and Hard Magnetic Thin Filins 


Soft Magnetic 





Materials 

4-jt M s (kG) 

H c (Oe) 

H k (Oe) 

Application 

Permalloy 

10 

0.5 

5 

Computer memory. 
magnetoresistance 
detectors, 
recording heads 

CoZr (amorphous) 

14 

< 0.5 

2-5 

- 

Fe 80 B 2 o (amorphous) 

15 

0.04 

7 

- 

Fe 72 Si 28 (amorphous) 

12 

0.2 

4 

— 

Fe 72 Si| g C 10 (amorphous) 

16 

0.2 

7 

— 

Y,Fe 5 0 12 

1 

- 0 

1000 

Magnetic 
bubble memory 
devices 

Hard Magnetic 

Materials 





Co-Re 

6-9 

700 


Longitudinal 
magnetic recording 
media 

Co-Pt 

10-18 

1100-1800 


* 

Co-Ni 

10-15 

1000-1300 
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T'Fe 2 0 3 (Co) 
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2100 


" 
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magnetic 
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TbFe 


2000-10000 
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recording media 
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- 10 

1000-2000 


n 

GdTbFe 


1000-3000 


** 


^Values for M s and H c depend strongly on composition and method of deposition. H c (parallel) values 
are typically 0.5// r (perpendicular). 

Note: 1 Oe = 80 A/m; 1 G = 10“ 4 T; H k (anisotropy field) - 2 K u /M s 
From Refs. 28-30. 
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E m - In bulk materials, domains are frequently smaller than the grain size, 
whereas the reverse is true in films. 

The response of a ferromagnet to an externally applied magnetic field is its 
most important engineering characteristic. Initially, M increases with H and 
eventually levels off at the saturation magnetization value M s . The application 
of H causes favorably oriented domains to grow at the expense of unfavorably 
oriented ones by domain boundary migration. At high enough fields, M even 
rotates. Further cycling of H in both positive and negative senses yields the 
well-known hysteresis loop. Depending on its size and shape ferromagnets are 
subdivided into two types—hard and soft, as indicated schematically in Fig. 
10-16. The distinction is based on the magnitude of the coercive force H c or 
field required to reduce M to zero. Hard magnetic materials with large H c are 
hard to magnetize and hard to demagnetize. That is why they make good 
permanent magnets and are used for magnetic recording media. Soft magnetic 
materials, on the other hand, both magnetize and switch magnetization direc¬ 
tions easily (small H c ). These are just the properties required for computer 
memory or recording head applications. In Table 10-4 the properties of both 
soft and hard magnetic thin-film materials are listed. How these ferromagnetic 
properties are manifested in applications is explored in the remainder of this 
chapter as well as in Chapter 14. 


10 - 7 , Magnetic Film Size Effects — M s vs. Thickness 
and Temperature 


10.7.1, Theory 

Magnetic property size effects are expected simply because the electron spin in 
an atom on the surface of a uniformly magnetized ferromagnetic film is less 
tightly constrained than spins on interior atoms. Fewer exchange coupling 
bonds on the surface than in the interior is the reason. Therefore, the question 
has been raised of how thin films can be and still retain ferromagnetic 
properties. At least four decades of both theoretical and experimental research 
have been conducted on the many aspects of this fundamental issue and related 
ones. The two theoretical approaches—spin wave and molecular field—both 
predict that a two-dimensional network of atoms of ferromagnetic elements 
should not be ferromagnetic but rather paramagnetic at absolute zero. At low 
temperatures a ferromagnet has very nearly its maximum magnetization. The 
deviation from complete saturation ( AM S ) is due to waves of reversed spin 
propagating through the material. By summing the spin waves according to the 
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T c 

(a) 



Figure 10-17. (a) Calculated temperature dependence of normalized saturation mag¬ 

netization for different numbers of film layers ( D ). (From Ref. 31). (b) Temperature 
dependence of relative bulk and surface magnetization in the ferromagnetic glass 
Fe 40 Ni 40 B 20 (From Ref. 33). 
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rules of quantum statistical mechanics, we obtain the magnitude of the devia¬ 
tion at any temperature (A M S (T)) relative to absolute zero M s (0). In bulk 
materials it is generally accepted that 

AM s (r)/M s (0) = BT 2/2 , (10-42) 

where B is the spin wave parameter. Its value at the surface has been 
calculated to be twice that in the bulk. In thin films, theoretical treatments of 
magnetic size effects are a subject of controversy. Early calculations show a 
relative decrease in M vs. T for films of varying thickness as indicated in Fig. 
10-17. For films thinner than four atomic layers M s varies linearly over a 
wide temperature range. 

The molecular field approach replaces the exchange interaction between 
neighboring spins around a particular atom by an effective molecular field. A 
statistical accounting of the number of interactions between an atom in the y'th 
layer of a film with other atoms in the same as well as j - 1 and j + 1 layers 
is the approach taken. Such calculations typically reveal that M begins to 
decrease below the value in bulk when the film thickness is less than some 
number of lattice spacings (e.g., 10), corresponding to a film thickness of 
perhaps 30 A. 

In recent years, quantum calculations of ferromagnetic films have achieved a 
high level of sophistication (Ref. 32). Spin densities in the ground state of Fe 
and Ni films consisting of a few atomic layers have, interestingly, been 
predicted to lead to an enhancement of the magnetic moment per atom in the 
outermost layer (e.g., by 20% in (001)Ni and 34% in (001)Fe compared to the 
bulk value found four layers away). Such surprising results rule out the 
existence of magnetically “dead” layers reported in the literature. 

Importantly, none of the aforementioned theories explicitly takes into ac¬ 
count such surface effects as lattice relaxation or distortion, surface reconstruc¬ 
tion, and pseudomorphic growth at real surfaces and interfaces. Rather, 
perfectly planar surfaces are assumed. 


10.7.2. Experiment 

Many experimental methods have evolved to yield direct or indirect evidence 
of ferromagnetic order in thin films. They broadly fall into three categories, 
depending on whether the 

1. spin polarization of electrons, 

2. net magnetic moment of the sample, or 

3. internal magnetic (hyperfine) field 
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is measured. The first relies on extracting electrons from the conduction bands 
of ferromagnetic solids by photoelectron emission and analyzing their energies 
by methods similar to those employed in Auger spectroscopy. Assuming no 
spin flips occur during emission, the number of majority and minority spins 
relative to the direction of magnetization of the surface can be determined. For 
example, the net surface magnetization of Fe 40 Ni 40 B 20 , an amorphous ferro- 
magnet ( T c = 700 K), was measured by detecting elastically backscattered 
spin polarized electrons (Ref. 33). The results are depicted in Fig. 10-17b for 
90 eV electrons which are estimated to probe only the topmost one or two 
atomic layers (—2.5 A). Method 2 relies on very sensitive magnetometers to 
directly yield macroscopic M vs. H behavior. Although relatively free from 
interpretation problems associated with indirect methods 1 and 3, the measure¬ 
ments are not surface selective. In the third method the hyperfine magnetic 
field // eff , which is to a good approximation proportional to the local atomic 
moment, is measured. Nuclear physics techniques such as nuclear magnetic 
resonance and Mossbauer effect are used; only the latter will be discussed here 
at any length. 

The Mossbauer effect is based on the spectroscopy of specific low-energy 
nuclear 7 -rays that are emitted (without recoil) from excited radioactive atoms 
embedded in a source. These y-rays are absorbed by similar nonradioactive 
ground state atoms contained within an absorber matrix. In the most famous 
Mossbauer transition, 57 Co nuclei emit 14.4-keV y-rays and decay to the 57 Fe 
ground state. An absorber containing 57 Fe, an isotope present in natural Fe 
with an atomic abundance of 2 . 2 %, can absorb the y-ray if its nuclear levels 
are very precisely tuned to this exact energy. Otherwise there is no absorption, 
and the y-ray will simply pass through the absorber and be counted by a y-ray 
detector. This is usually the case because differences in the local electromag¬ 
netic environment of Fe 57 in both the source and absorber alter the 14.4-keV 
level slightly, destroying the resonance. Fortunately very small, easily pro¬ 
duced Doppler effect energy shifts, caused by relative source-absorber veloci¬ 
ties of only — ± 1 cm/sec, can increase or decrease the energy sufficiently to 
restore the resonance. Mossbauer spectra thus reveal relative energy differ¬ 
ences in y-ray transitions of 57 Fe as they are affected by atomic surroundings. 
In a ferromagnetic absorber the 57 Fe nucleus is immersed in the internal 
magnetic field that splits the nuclear levels, an effect that is the counterpart to 
Zeeman splitting of atomic electron levels. Six transitions can now be accessed 
by using an appropriate (unsplit) source. 

Mossbauer spectra of Fe film absorbers of varying thickness grown epitaxi¬ 
ally on Ag are shown in Fig. 10-18. The detection of y-ray-induced conversion 
electrons plus the use of enriched 57 Fe layers provide the necessary sensitivity 
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V [ mm/s ] 

Figure 10-18. Mossbauer spectra of ultrathin epitaxial (110) Fe films on (111) Ag at 
room temperature. (From Ref. 32 with permission from Elsevier Science Publishers). 


to probe ultrathin films. The six spectral lines characteristic of ferromagnetic 
behavior are clearly discernible at film thicknesses of 5 A. In the 620-A film 
the velocity or energy span between outer lines is essentially equivalent to the 
bulk // eff of 330 kOe. Peak broadening plus a decreased span in the very 
thinnest films signify a distribution of somewhat reduced field strengths. 


10 . 8 . Magnetic Thin Films for Memory Applications 


10.8.1. Introduction 

Interest in magnetic films arose primarily because of their potential as com¬ 
puter memory elements. Although semiconductor memory is firmly established 
today, a quarter of a century ago small bulk ferromagnetic ferrite cores were 
employed for this purpose. Even earlier it was discovered that magnetic films 
deposited in the presence of a magnetic field exhibited square hysteresis loops. 
This meant that magnetic films could be used as a bistable element capable of 
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switching from one state to another (e.g. from 0 to 1). Switching times were 
about 10 -9 sec, a factor of 100 shorter than that for ferrite cores. The promise 
of higher-speed computer memory and new devices fueled a huge research and 
development effort focused primarily on Permalloy films. Despite initial 
enthusiasm it was found that careful control of magnetic properties produced 
formidable difficulties. The metallurgical nightmare of film impurities, imper¬ 
fections, and stress was among the reasons that actual performance of these 
films fell short of originally anticipated standards (Ref. 34). 

In the mid-1960s an entirely new concept for computer memory and data 
storage applications was introduced by investigators at Bell Laboratories (Ref. 
35). It employed special magnetic thin films (e.g., garnets) possessing cylindri¬ 
cal domains known as bubbles. Unlike the switching of M in Permalloy films, 
information is processed through the generation, translation, and detection of 
these bubbles (Section 10.8.4). Domain behavior is critical to both approaches, 
and we therefore turn our attention to this subject now. 

10.8.2. Domains in Thin Films — M in Film Plane 

When Permalloy and other soft magnetic materials are vapor deposited in a 
magnetic field of - 100 Oe, M lies in the film plane and in the field 
direction. Uniaxial anisotropy develops such that a 180° rotation of M occurs 
across the boundary or wall separating adjacent domains. Schematic illustra¬ 
tions of two ways the rotation can be accommodated are shown in Fig. 10-19. 
In the Bloch wall, which is common in bulk ferromagnets, the spins undergo a 
rotation about an axis parallel to the hard direction. There are two types of 
Bloch walls: one in which the magnetization in the wall center points upward, 
and one in which it points down. Therefore on the film surface there are 
free-magnetic poles just above the wall region. These establish stray fields that 
increase the magnetostatic energy of the system. With decreasing film thick¬ 
ness increases since more free poles exist. In very thin films there is 
another type of wall with a much lower value of e m . It is shown in Fig. 
10-19b, and is known as the Neel wall. In Neel walls the direction of 
magnetization turns about an axis perpendicular to the film plane; there are no 
free poles in this case. 

In both types of domain walls E k is smallest when the change in M is 
abrupt—i.e., when the wall is as narrow as possible; but this serves to increase 
E cx , which is minimized when spin pairings remain tightly aligned—i.e., when 
the wall is as wide as possible. A compromise is struck when the total 
magnetic energy, E T —E K -1 - E cx + E m , is minimized with respect to number 
of wall spins. Typically, domain walls are 1000 A wide and have an effective 
surface energy of several ergs/cm 2 . 
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Figure 10-19. Schematic illustrations of magnetization directions at domain walls, 
(a) 180° Bloch wall; (b) 180° Neel wall; (c) cross-tie wall. (From Ref. 34). 



FILM THICKNESS- 

Figure 10-20. Calculated surface energy of a Bloch wall, a Neel wall, and a cross-tie 
wall as a function of film thickness. {A x ~ 10“ 6 erg/cm, M s = 800 gauss, K u = 1000 
ergs/cm 3 ). (From Ref. 27 with permission from McGraw-Hill, Inc.) 

The results of a calculation of the surface energy of Bloch, Neel, and 
cross-tie walls as a function of film thickness are shown in Fig. 10-20. 
Cross-tie walls are essentially variants of unipolar Neel walls and are shown in 
Fig. 10-19c. They consist of tapered Bloch wall lines jutting out in both 
directions from the Neel wall spine. This configuration promotes magnetic flux 
closure and possesses a lower overall energy than the simple Neel wall. The 
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Figure 10-21 . Lorentz micrograph of a cross-tie Neel wall transition in a 300-A-thick 
Permalloy film. (From Ref. 27 with permission from McGraw-Hill, Inc.) 


very thinnest films are predicted to contain Neel walls and films thicker than 
1000 A, Bloch walls. In between cross-tie walls are stable; they have been 
observed in Permalloy films within the predicted film thickness range, as 
shown in the Lorentz micrograph of Fig. 10-21. In this technique the film is 
mounted slightly above (or below) the focal plane of the objective lens in a 
transmission electron microscope. Electrons passing through the film are 
deflected owing to the Lorentz forces, and produce a kind of shadow image of 
the magnetization. The resolution of the technique is sufficient to detect 
magnetic ripple. The latter is a fine wrinkling substructure within domains 
where M undergoes slight periodic misalignments from the uniaxial direction. 
Ripple is apparently due to complex coupling effects between exchange and 
magnetostatic forces in neighboring crystallites and extends over tens to 
hundreds of angstroms. 


10.8.3. Single-Domain Behavior 

When films with uniaxial anisotropy are exposed to magnetic fields, the 
magnetization direction switches the way single domain particles do. Because 
of the importance of switching phenomena in memory applications, it is of 
interest to consider a simple model for this behavior (Ref. 36). A central 
assumption is that Eq. 10-39 holds. Equilibrium states of minimum energy 
exist for 0 = 0, the easy magnetization direction, as well as for 0 = it. But for 
0 - tt/2, 37t/2, the hard directions, there are energy maxima. The total 
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energy of the film in an applied magnetic field with components H x and H y in 
the easy and hard directions is 

E r = -M s H x cos 0 - Af s H y s\n 6 + K u sin 2 0. (1(M3) 

In stable equilibrium the magnetization angle is determined by the condi¬ 
tions that 

dE T /30 = 0, (10-44) 

which amounts to a vanishing torque, and 

d 2 E T /36 2 > 0. (10-45) 

If d 2 E T /dO 2 < 0, the equilibrium is unstable ; transitions from unstable to 
stable states occur at critical fields for which 3 2 E T /b9 2 = 0. Successive 
differentiations yield the conditions for stable equilibrium, respectively, 

dE r /dQ = M s H x s\n 0 - M s H y cos 0 + 2K u sin 0cos 0 = 0, (10-46a) 


d 2 E T /d9 2 = M S H X cos 9 + M s H y sin 0 + 2K u (cos 2 9 - sin 2 0) > 0. (10-46b) 

We are now in a position to calculate hysteresis loops. The two simplest 
cases occur when the fields are applied in the easy and hard directions. 

1. Easy direction (H y = 0). From Eq. 10-46 the two solutions are 

M S H X = -2K u cosO\ sin0 = O. 

For the first solution, d 2 E T /60 2 = -2K u sin 2 0 is always negative, (except 
for 9 = 0 and 9 — 7r), and hence the equilibrium is not stable. However, the 
second solution yields 9 = 0, tt\ it is easily shown that 9 = 0 is stable for 
H x > -2K lt /M s , and 9 = 7 r is stable when H x < 2K U /M S . These values of 
are defined as the anisotropy field H k . The magnetization in the easy 
direction is M x = M s cos 0, so that M x = ±M S . In Fig. 10-22a the resulting 
square-loop hysteresis curve is drawn, where H k is equivalent to H c . 

2, Hard direction (H x = 0). In this case M y reaches +M 5 when H y 
exceeds 2K U /M S > and has the value -M s when H y < -2K U /M S . For 
applied H y fields between these values, M y varies linearly; i.e., M y = 
M 2 H y /2K U . The multivalued character of the loop vanishes as shown in the 
hysteresis curve of Fig. 10-22b. 

Loops observed experimentally in films differ from the calculated ones. 
Whereas square loops in the easy direction have been measured, coercive 
fields are usually much smaller than 2 K u /M s . The reason is that magnetiza- 
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Figure 10-22. Theoretical hysteresis loops: (a) In the easy direction; (b) in the hard 
direction. Experimental hysteresis loops (Permalloy, 300 A thick); (c) in the easy 
direction; (d) in the hard direction. (From Ref. 27 with permission from McGraw-Hill, 
Inc.). 


tion reversal does not occur by uniform rotation, but by domain translation and 
rotation. 


10.8.4. M Perpendicular to Film Plane 

Imagine a bar magnet compressed axially to thin film dimensions. It would 
have numerous north poles on one surface opposed by the same number of 
south poles on the other surface giving rise to a large value of E M . Unlike 
films that develop an in-plane magnetization in such a case, there are materials 
where M points normal to the film surface (Ref. 37). Examples are single¬ 
crystal films of magnetoplumbite (Pb0-6Fe 3 0 4 ), ortho -ferrites (RE Fe0 3 , 
with RE a rare earth element), and, most importantly, magnetic garnets (e.g., 
Y 3 Fe 5 0 J2 ). Competition between E M and E K results in the formation of 
domains essentially possessing a uniaxial perpendicular anisotropy. The 
domain structure in such films is striped, displaying the fingerprint pattern of 
Fig. 10-23. Dark and light domains have oppositely pointed magnetization 
vectors. By viewing these transparent films through crossed polarizers, one 
notes an optical contrast between oppositely magnetized domains. The Faraday 
effect, a magneto-optical phenomenon, is responsible. It causes the plane of 
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Figure 10-23. Domain pattern in yttrium iron garnet film viewed in transmitted 
polarized light (200 X ). 

transmitted polarized light to rotate, depending on the direction of M in 
individual domains. 

What makes these materials remarkable is that for certain applied (strip-out) 
fields, the unfavored stripe domains will shrink into stable right-cylindrical 
domains called bubbles . An important property of the bubbles is their ability 
to be moved laterally through the film. This is the basis for their use in 
commercial bubble devices for the computer memory and data recording 
markets. In these applications a thin-film array of conductors and Permalloy 
films patterned in various shapes (chevrons, I and T bars, etc.) are deposited 
on top of the bubble film. Bubbles can then be generated, moved, switched, 
counted, and annihilated in a very animated way in response to the driving 
fields. The stability, size and speed of bubbles are the key design parameters 
affecting device reliability, memory capacity, and data rate, respectively. 

1. Stability. Isolated bubble domain stability occurs when the ratio Ek/Em 
(or, equivalently, K u /2kM is greater than unity. A high value of K u , or 
in-plane anisotropy, encourages the 90° rotation into the perpendicular orienta¬ 
tion. Unless the ratio is sufficiently large, in-plane drive fields can strip out 
bubbles into stripe domains. 
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2. Size. The bubble diameter is predicted to be about 8 A X K U jrM] in 
size. It is left to the reader to show that \]A X K U is proportional to the domain 
wall energy; a small value of the latter fosters small bubbles. A large value of 
E m (or A/ 2 ) also favors small bubbles by reducing the surface density of free 
poles through domain formation. 

3 . Speed . Bubble speed is determined by the product of the drive field 
minus the threshold field for movement (coercive field) , and the bubble 
mobility y. m (vel ocity per drive field gradient). The latter is proportional to 
(1/a) \/A x /K u , where a is the Gilbert damping or magnetic viscosity 
parameter. 

Selection of optimum properties clearly involves trade-offs. Garnets possess 
the best combination of properties and have been most widely employed for 
magnetic bubble devices. Specifications currently call for: bubble size = 
0.5-1.0 fi m, 4 t rM s = 500-1000 gauss, H k - 1000 Oe, coercive field - 0, 
and fi m > 300 cm/sec-Oe. A considerable number and range of possible 
chemical substitutions are available to modify the basic garnet composition— 
{Y 3 + } 3 [Fe 3+ ] 2 (Fe 3 + ) 3 Or 2 2 . For example {Y, La to Lu, Bi, Ca, Pb}, [Fe, 
Mn, Sc, Ga, Al], and (Fe, B, Ga, Al, Ge) ions are used to facilitate growth of 
solid solution films with required device properties. Epitaxial garnet films are 
usually grown by liquid phase methods employing single-crystal Gd 3 Ga 5 0 12 
substrates. Since bubble motion is adversely affected by film defects elimina¬ 
tion of the latter is essential. 


10.8.5. Memory Device Configurations (Ref. 27, 38) 

The chapter closes by briefly conveying some notion of the two basic thin-film 
magnetic memory schemes that have been devised. In Fig. 10-24 a portion of 
a memory system based on in-plane magnetization film elements is shown. The 
latter are small isolated rectangles crisscrossed by three sets of conducting 
stripes. Word (W) drive lines run parallel to the easy axis which points either 
to the left (0) or to the right (1) when there is no applied magnetic field. A 
pulse field is then applied in the W line in the hard direction; M rotates 
toward this direction, and either a positive or negative signal is induced in the 
sense (5) line. If the W line signal exceeds H Ky then M rotates fully into the 
hard direction. When the W signal is reduced, the direction of M wavers. To 
avoid this instability, we apply a bit (B) field parallel to the easy axis to store 
the desired 0 or 1 state. In order to write, we must have the W pulse field large 
enough to drive all the bits into the hard direction. Similarly, the bit 
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Figure 10-24. A memoir plan with word lines , W 2 , and W 2 , bit lines B x , B 2y 
and B 2 and sense lines S, , S 2 , and S 3 . (From Ref. 27 with permission from 
McGraw-Hill, Inc.). 
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Figure 10-25. Schematic arrangement of a complete bubble memory device. (From 
Ref. 38). 




pulses must be large enough to ensure complete rotation either to the right or 
left without disturbing bits on other W lines. 

The schematic arrangement of a complete magnetic bubble memory device 
is shown in Fig. 10-25. Bubble creation occurs in the replicating generator 
where the magnetic field from a current pulse cuts a seed bubble in two. The 
seed bubble remains under a large Permalloy film patch for further bubble 
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generation while the freshly nucleated bubble enters the input track. There the 
bubble moves within constraining Permalloy film elements under the influence 
of current-induced, rotating magnetic fields. The swap gate enables bubbles to 
be transferred out of a storage loop and another bubble (or no bubble) to be 
simultaneously transferred to replace it. (Only one of a block of storage loops 
is shown.) Replication is similar to replicate generation; the difference is that 
bubbles in the storage loop, rather than form a seed bubble, are replicated. 
Finally bubbles are detected by the Permalloy magnetoresistance detector. 
After coming off the output track the chevron stretcher expands the bubble into 
a long stripe to maximize the detector signal. As the stripe passes under the 
interconnected column of chevrons (the detector), it changes the resistance of 
the Permalloy and gives rise to the output signal. 

Bubble memory devices with capacities in excess of a megabit are commer¬ 
cially available and offer advantages relative to disks and tapes. These include 
high storage capacities (~ 10 9 bits/cm 2 ) with 0.5-pm bubbles, absence of 
mechanical wear, nonvolatile memory, and a wide-temperatu re-range 
read-write memory. 



Exercises 



1. During four-point probe resistance measurements it is desired to limit 
currents to less than 0.050 A to prevent overheating the probe tips. The 
typical digital voltmeter available for measurement of the potential drop 
has a range of 10 mV to 100 V. Which of the following 5000-A-thick film 
materials have sheet resistance that are readily measurable by this method. 

a. Cu; p = 1.73 X 10“ 6 Q-cm? d. CoSi 2 ; p = 15 X 10” 6 Q-cm? 

b. Si; p = 2 Q-cm? e. TiN; p = 100 x 10“ 6 Q-cm? 

c. Zr0 2 ; p = 10 14 Q-cm? 

2 . A thin-film window de-icer resistor meanders over a length of 5 m and is 
1 mm wide. It is designed to deliver a total power of 5 W, employing a 
12-V power source. For a 5000-A-thick film, what sheet resistance is 
required? 

3 - Derive an expression for the ratio of the electrical resistance of a metal 
film stripe of length 21 evaporated from a surface source a distance h 
away to that of a uniformly thick-film stripe having the same number of 
atoms. Assume the film conductor is piecewise straight. 
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4 - The resistivity of an evaporated metal thin film measured at low tempera¬ 
ture is higher than the bulk resistivity value extrapolated to the same 
temperature. Reasons postulated for this behavior are (1) the size effect 
due to surface scattering, (2) a high concentration of quenched-in vacan¬ 
cies, and (3) generation of dislocation defects during cooling. Design a 
course of experimentation to resolve the cause. 

5 . An insulating film is characterized by $ B = 1 eV. How much does the 
Schottky emission current change at 100 °C for 

a. a 1 % change in temperature? 

b. a 1% change in electric field? 

c. a 1% change in film contact area? 

6- Conductor stripes deposited over insulators create resistor-capacitor 
structures whose RC time constants serve to limit the speed of circuit 
response. 

a. Derive a general expression for RC in terms of the length, width, 
thickness, and resistivity of the conductor, and the corresponding area, 
thickness, and dielectric constant (e,) of the insulator. 

b. For 1-pm-wide, 1-pm-thick, 1-cm-long A1 stripes (p = 2.7 X 10 -6 

Q-cm) across a 0.5-pm-thick Si0 2 film (e, = 3.8), what is RC 

(sec/cm)? [Note: The permittivity of free space is 8.85 X 10“ 14 
F/cm.] 

c. What is RC (sec/cm) if TaSi 2 (p = 55 X 10” 6 Q-cm) and poly-Si 

(p = 1000 X 10 ~ 6 fi-cm) are substituted for Al? 

7 - Two metal contact pads 1 cm apart on a bare NaCl substrate are used to 
monitor electrical resistance changes in a metal film that is deposited 
between them. Schematically plot the expected resistance-time behavior 
from the start of evaporation until the film is completely continuous. 
What are the operative conduction mechanisms during the various stages 
of film formation? Contrast the resistance response for high and low 
substrate temperatures. Also contrast the resistance response for island 
and layer growth mechanisms. 

8 - Using the approach of Thomson, calculate the mean-free electron path 
(Xy) in a thin metal film if specular scattering occurs at the air interface 
while the scattering at the substrate interface is diffuse. 

9- Consider an M/Si0 2 /N-Si capacitor structure (assume no oxide or 
interfacial charge). 
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a. Sketch band diagram representations for accumulation, depletion, and 
inversion regimes. 

b. Sketch the C- V characteristic. 

c. Sketch a series of C-V characteristics as a function of Si0 2 film 
thickness. 

d. Qualitatively indicate how the C-V characteristics change as a func¬ 
tion of Si doping level. 

. _ o 

10- A bulk plate of Si0 2 and a 3000-A film of Si0 2 are electrically contacted 
on both surfaces by Al. Assume the bulk resistivity of Si0 2 is 10' 5 G-cm 
and that an electric field of 10 5 V/cm is applied in both cases at 300 K. 

a. What current density flows through the bulk plate? 

b. What current density flows in the film if Schottky emission controls 
conduction? 

. , o 

11 - A 1000-A Si0 2 film underwent dielectric breakdown when an electric 
field of 1.2 X 10 7 V/cm was applied. It was observed that a l-/im 2 
damage channel through the Si0 2 shorted the Au electrodes and that 
failure occurred in 10 4 sec. If current flow is controlled by Schottky 
emission, estimate the maximum temperature rise in the film if its heat 
capacity is 0.24 cal/g °C. 

12 . a. Derive a general expression for the specific contact resistance if 
conduction occurs via tunneling. 

b. Do the same for Poole-Frenkel conduction. 

c. Estimate a value for R c given the J-i characteristics shown in Fig. 
10-10a. 

13- A sample of a 1-^m-thick (magnetic)film of Y 3 Fe 3 0 12 on a nonmagnetic 
Gd 3 Ga 5 0 12 substrate disk (2-cm diameter, 0.75-mm thick) fell from a 
microscope stage. Attempts to lift the film-substrate with a common bar 
magnet proved unsuccessful. Why? When suspended from a slender 
thread, however, the film-substrate is attracted to the magnet. Why? 

14 - a At zero magnetic field the total energy ( E T ) of a magnetic film is 

increased when the number of domains is increased. Why? At the 
same time E M decreases. Why? Schematically sketch these contribu¬ 
tions to E t and graphically indicate the optimum number of domains, 
b. The film in part (a) is magnetized, and cylindrical bubble domains 
interact with the applied field. Assume the only contributions to E T 
are domain wall energy ( E w ) t and Making appropriate assump- 



References 


505 


tions, sketch the dependence on bubble radius of the individual energy 
contributions and graphically indicate the equilibrium domain radius. 

15 . a. Use Fig 10-17 to construct a plot of the magnetization as a function of 

Film thickness at T / T c = 0.3. 

b. The difference in velocity between the outermost Mossbauer lines is 
proportional to the magnetization of Fe in Fig. 10-18. By measuring 
this velocity span, plot the variation of Film magnetization as a 
function of Film thickness. Compare results of (a) and (b). 

16. For the single-domain Film show that 

+ H 2 y » = 

This equation describes an astroid. To see what it looks like, make a plot 
in H x , H y coordinates. 
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Optical Properties 
of Thin Films 


11 . 1 . Introduction 


Thin films were first exploited practically for their optical properties. In the 
latter part of the nineteenth and first half of the twentieth centuries, the 
reflecting properties of metal films were utilized in assorted components of 
precision optical equipment. A noteworthy example was the Fabry-Perot 
interferometer developed in 1899, which required mirrors of very high re¬ 
flectance or “finesse.” This instrument enabled impressive accuracy to be 
attained in spectroscopy, thereby greatly advancing research in this field. The 
utility of dielectric films in optical applications was interestingly recognized as 
a result of observations by early spectroscopists and microscopists, notably 
Lord Rayleigh and Fraunhofer. They noticed that atmospheric corrosion of the 
lens surfaces of their instruments actually resulted in an enhanced overall 
transmission rather than a deterioration of performance. Interference effects of 
a surface layer was quickly discovered as the cause and it was not long before 
this damaging effect was capitalized upon in the form of antireflection (AR) 
coatings. These were first produced commercially by chemical etching, a 
process which persisted until the 1950s. In the mid-1930s, however, AR 
coatings were first produced by vacuum evaporation techniques and eventually 
proved to be more versatile and reliable than those made by etching. Coated 
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lenses then found rapid application in optical imaging equipment such as 
cameras, telescopes, binoculars, and microscopes. Similar coatings were sub¬ 
sequently employed in dielectric mirrors, optical filters, and selective ab¬ 
sorbers (Ref. 1). 

Current interest in optical films still centers around traditional optical 
components. But, in addition, a variety of new applications has emerged—end 
mirrors for lasers, antireflection coatings for solar cells, and films for energy 
conservation systems, to mention a few. The portion of the electromagnetic 
spectrum involved in virtually all optical film applications falls within the span 
from the ultraviolet (UV) to the infrared (IR) with particular emphasis focused 
in the visible region. This chapter will therefore be limited to the effects in this 
broad spectral domain. A common optical theory governs the phenomena 
exhibited by the many thin-film coating applications irrespective of operating 
wavelength range. Specific metal, dielectric, and semiconductor films have 
been deposited, frequently in layered combinations, to produce the necessary 
optical characteristics. For these reasons the broad topical outline of the 
chapter includes: 

11.2. Properties of Optical Film Materials 

11.3. Thin-Film Optics 

11.4. Multilayer Optical Film Applications 

Additional discussion of thin films employed in optoelectronic devices and 
optical communications (Chapter 7), integrated optics (Chapter 14), and optical 
recording (Chapter 14) can be found in the indicated chapters. 

Ample measure of the overall importance of optical film properties and 
components is found in the relatively large number of books devoted to the 
subject (Ref. 2-6). With regard to these the excellent treatment by Anders is 
recommended for its lucid presentation of the analysis, design, and production 
of coatings. Similarly the more recent and accessible book by Pulker is 
recommended for its wealth of useful data on optical coatings, and how they 
are processed and used. Readers seeking a more specialized treatment of 
particular topics should consult the widely acclaimed series Physics of Thin 
Films . Of the 67 review articles published in 13 volumes until 1987, fully 19 
deal with optical properties and applications of metal and dielectric thin films. 


11.2. Properties of Optical Film Materials 


11.2.1. General Considerations 

In order to understand the optical behavior of films and film systems, one must 
become familiar with the optical constants of materials, their origins, magni- 
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tudes, and how they depend on the way films are processed. The purpose of 
Section 11.2 is to provide a brief survey of these issues. The unifying concept 
that embraces all optical properties is the interaction of electromagnetic 
radiation with the electrons of the material. On this basis, optical properties are 
interpretable from what we know of the electronic structure and how it is 
affected by atomic structure, bonding, impurities, and defects. Quantum 
mechanics is required for a detailed description, but this is well beyond the 
scope intended here. Discussions will stress meanings and implications rather 
than mathematical rigor. 

Electromagnetic radiation propagates differently in materials than in free 
space because of the presence of charge. As a result, there is a change in the 
wave velocity and intensity of the radiation described by the complex index of 
refraction 

N ~ n — ik . (11-1) 

The quantity n is the real index of refraction, and k is the index of absorption, 
which is also known as the extinction coefficient. These are the two material 
optical constants that we are concerned with here. The spatially dependent 
portion of the electric field of a wave propagating in the x direction is then 
expressed by 

( HirNx \ 2irkx ilirnx 

- = ^ 0 exp--—exp--—, (11-2) 

A / A A 

where £ 0 is the field amplitude and X is the wavelength. In free space the 
index of refraction is unity and the wave velocity is c, the speed of light; in the 
medium these respective quantities are n and c/n. The real function exp — 
2irkx /\ represents an exponential damping or attenuation of the wave due to 
some absorption process within the material. On the other hand, the imaginary 
exponential portion of Eq. 11-2 contains n and reflects propagation without 
absorption. All materials exhibit varying proportions of these two linked 
attributes of N. In the highly absorbing metals, for example, n is usually small 
compared with k. On the other hand, the dielectric films used for optical 
purposes are highly nonabsorbing and k is vanishingly small compared with 
n . Multiplying £ by its complex conjugate (i.e., / = V— 1 replaced by -/) 
leads to the well-known expression for the attenuation of the intensity, I oc 
or /oc <f 0 2 exp - 4 t rkx/\. Therefore, 

I = / 0 exp - ax , (11-3) 

where the absorption constant a is defined as 4 tt/:/X, and / 0 is the intensity 
of incident radiation. A common unit for I 0 is W/cm 2 . An important applica¬ 
tion of this formula occurs in optical communications systems. A measure of 
the quality of the waveguide frequently quoted is attenuation in decibels (dB); 
the number of dB = 101og 10 / 0 //. By comparison with Eq. 11-3, 1 dB/cm is 
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equal to 4.34a. Extremely low absorption losses of ~ 1 dB/km are common 
in silica-based optical fibers. In contrast, a 1-dB loss occurs within a few 
angstroms of penetration beneath the surface of a metal. This corresponds to a 
difference of some 12 orders of magnitude in absorption behavior on the part 
of materials. 

Of the total radiation energy incident on an object, a fraction R is reflected 
from the top surface and a fraction T is transmitted through the bottom 
surface. The remaining fraction is lost through electronic absorption (A) 
processes and by scattering ( S ) at surface and volume imperfections. Surface 
roughness, internal boundaries, and density fluctuations arising from porosity, 
pinholes, microcracks, particulate incorporation, and impurities are sources of 
scattering. Adding the various contributions gives 

/?+ T+A +S= 1. (11-4) 


Of the terms, R is of the greatest interest to us. For light passing through a 
medium of refractive index /z 0 , impinging normally on a transparent film of 
index n { , 


«1 - ”o \ 2 
n, + n 0 l 


(11-5) 


If, however, the film is absorbing with index of absorption k x 

R = K~”i ) 2 + *? 

(«o + rt,) 2 + k\ 

The origins of both formulas will be indicated subsequently. 


( 11 - 6 ) 


11.2.2. Optical Properties of Metals and Mirrors 

The large density of empty, closely spaced electron energy states above the 
Fermi level in a metal plays an important role in influencing its optical 
properties. Incident photons over a wide wavelength range are readily ab¬ 
sorbed by conduction-band electrons. These excited electrons move to higher 
energy levels where they undergo collisions with lattice ions, and the extra 
energy is dissipated in the form of phonons. The lattice is thus heated, and we 
speak of absorption. Alternatively, if the probability of colliding with an ion is 
small, the electron will emit a photon as it drops back to a lower energy level. 
This results in the strongly reflected beam exhibited by metals in the visible 
and infrared region. The characteristic color of some metals is due to the 
preferential absorption of some portion of the visible spectrum. In gold, for 
example, the green portion is absorbed, and the metal assumes the coloration 



11.2 Properties of Optical Film Materials 


511 


Table 11-1. Optical Constants of Metal Films Employed for Mirrors 


Wavelengths, n 

Silver 

Gold 

Copper 

Aluminum 

n 

k 

n 

k 

n 

* 

n 

k 

0.40 

0.075 

1.93 

1.45 


0.85 


0.40 

3.92 

0.45 

0.055 

2.42 

1.40 

1.88 

0.87 

2.20 

0.49 

4.32 

0.50 

0.050 

2.87 

0.84 

1.84 

0.88 

2.42 

0.62 

4.80 

0.55 

0.055 

3.32 

0.34 

2.37 

0.72 

2.42 

0.76 

5.32 

0.60 

0.060 

3.75 

0.23 

2.97 

0.17 

3.07 

0.97 

6.00 

0.65 

0.070 

4.20 

0.19 

3.50 

0.13 

3.65 

1.24 

6.60 

0.70 

0.075 

4.62 

0.17 

3.97 

0.12 

4.17 

1.55 

7.00 

0.75 

0.080 

5.05 

0.16 

4.42 

0.12 

4.62 

1.80 

7.12 

0.80 

0.090 

5.45 

0.16 

4.84 

0.12 

5.07 

1.99 

7.05 

0.85 

0.100 

5.85 

0.17 

5.30 

0.12 

5.47 

2.08 

7.15 

0.90 

0.105 

6.22 

0.18 

5.72 

0.13 

5.86 

1.96 

7.70 

0.95 

0.110 

6.56 

0.19 

6.10 

0.13 

6.22 

1.75 

8.50 

2.0 

0.48 

14.4 

0.54 

11.2 



2.30 

16.5 

3.0 





1.22 

17.1 



4.0 

1.89 

28.7 

1.49 

22.2 



5.97 

30.3 

6.0 

4.15 

42.6 

3.01 

33.0 



11.0 

42.4 

7.0 





5.25 

40.7 



8.0 

7.14 

56.1 

5.05 

43.5 



17.0 

55.0 

10.0 

10.69 

69.0 

7.41 

53.4 



25.4 

67.3 


From Ref. 7. 



Figure 11-1. Reflectance of Al, Ag, Au, Cu, Rh, and Pt mirror coatings from the 
UV to the IR. (From Ref. 8). 


of the reflected red and yellow light. Silver and aluminum reflect all portions 
of the visible spectrum and therefore appear to have a white color. 

Optical constants for the most important mirror-coating metals, Ag, Au, Cu, 
and Al, are given in Table 11-1. The corresponding reflectivities are plotted as 
a function of wavelength in Fig. 11-1. A noteworthy feature of the optical 
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response is the decreased reflectivity in the ultraviolet and, in particular, the 
abrupt absorption edge exhibited by the noble metals. The absorption is due to 
interband electron transitions (e.g., 3d -> 4s in Cu). In aluminum, however, 
and rhodium, to a lesser extent, high reflectance extends into the ultraviolet. 
In the infrared all metals are very highly reflective. This fact is explained by 
the Hagen-Rubens relation, which holds when the electron relaxation time is 
short compared with the period of the incident wave. Physically, electrons can 
then respond sufficiently rapidly to prevent the external electric field from 
penetrating the metal. 

The optical properties of thin films are somewhat different from those of 

o 

bulk metals. In uitrathin films (< 100 A thick) variations in film continuity 
make the concept of optical constants problematical. In thicker continuous 
films, n and k values tend to be slightly smaller and larger, respectively, than 
comparable bulk values. The differences stem largely from variables encoun¬ 
tered in evaporation or sputtering, the almost universally employed processes 
for depositing metal film mirrors. In metals such as Al, which readily getter 
gases, the reflectivity is sensitive to deposition conditions. Lower operating 
pressures and higher deposition rates result in purer films with enhanced 
reflectivities as shown in Fig. 11-2. The reason is due in part to the absence of 
surface oxides, nitrides, etc., which serve to increase absorption. More noble 
metals like Rh and Pt are, on the other hand, relatively immune to deposition 
conditions. Other variables affecting reflectivity include incident angle of 



DEPOSITION RATE (A/SEC) 


Figure 11-2. Reflectance of Al as a function of deposition rate in high (3 x 10~ 6 
torr) and ultrahigh (5 x 10 -9 torr) vacuum. Films were aged 24 h in air and measured 
at a wavelength of 2000 A. (From Ref. 7). 
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vapor flux, surface topography, and substrate temperature (Refs. 7, 8). In 
metals such as Al, Ag, and Au, high substrate temperatures result in coarser- 
grained films with rough surfaces that tend to diffusely scatter light. A contrary 
behavior is exhibited by Rh whose reflectivity is enhanced by some 2-6% over 
the spectral range from 0.4-2.2 /xm, when deposited at 400°C rather than 
40°C. 

The reader should appreciate that even small reflectance enhancements are 
very significant in mirror applications. Therefore, much care is exercised 
during deposition, and steps are frequently taken subsequently to prevent 
degradation of reflectance due to oxidation (Al), sulfide tarnishing (Ag), and 
mechanical scratching. Mirrors are therefore frequently coated with hard 
transparent protective overlayers. Thus films of SiO, Si0 2 , and A1 2 0 3 are 
commonly used to protect evaporated Al mirrors, but usually at the cost of 
increasing absorbance. The superior reflectance of Ag is offset by its poor 
adhesion and susceptibility to atmospheric attack. Various substrate film glue 
layers (e.g., A1 2 0 3 , nichrome) and coatings (e.g., A 2 0 3 , SiO) have been 
used to improve adhesion, but Ag film use in mirrors remains restricted. Films 
of Rh are ideally suited as front surface mirrors because they are hard and 
chemically very durable giving them long-term stability. Adhesion to fused 
silica based substrates is often poor, however. Despite its relatively low 
reflectivity, evaporated Rh has found application in telescope mirrors, optical 
reflectivity standards and in mirrors for medical applications. 

11.2.3. Optical Effects in Dielectrics 

Dielectric materials employed in optical coating applications include fluorides 
(e.g., MgF 2 , CeF 3 ) oxides (e.g., A1 2 0 3 , Ti0 2 , Si0 2 ) sulfides (e.g., ZnS, 
CdS) and assorted compounds (e.g., ZnSe, ZnTe). Bonding characteristics 
ranging from ionic to covalent are represented in these materials. An essential 
common feature of dielectric optical materials in their very low absorption 
(a < 10 3 /cm) in some relevant portion of the spectrum; in this region they are 
essentially transparent (e.g., fluorides and oxides in the visible and infrared, 
chalcogenides in the infrared). 

The refractive index n is basically the only optical constant of interest 
insofar as optical coating design is concerned. In different materials the 
refracted beam travels at different velocities because incident and forward- 
scattered beams interact and produce a phase shift. This phase shift can be 
interpreted in terms of a difference in velocity between incident and refracted 
beams. As might be expected the magnitude of n depends on the strength of 
the refracted beam, which in turn depends on the density of electrons. In polar 
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materials such as oxides, glasses, and compound semiconductors the lattice is 
essentially a collection of permanent electric dipole moments ( p e ). The 
magnitude of p e is equal to the product of the effective charge (nuclear, 
electronic) and the distance between charge centers. Wave retardation occurs 
because of the interaction between the electromagnetic radiation ($ field) and 
the stretched permanent dipoles. In nonpolar solids like diamond and Si, the 
incident radiation creates induced electric dipole moments by displacing 
atomic electrons relative to the nuclear charge. In this case, p e = a p S\ where 
a p is known as the polarizability. It is beyond our scope to derive the link 
between the microscopic polarization and the macroscopic index of refraction 
known as the Lorenz-Lorentz equation, namely 

3 ( n 2 - 1) M 

p (« 2 + 2) p 

where N A is Avogadro’s number, M is the molecular weight, and p is the 
density (Ref. 9). Thus, high refractive indices are associated with large ionic 
polarizabilities, which increase with the size of the ion and with the degree of 
negative charge on isoelectric ions. In glasses and cubic crystals, n is 
independent of crystallographic directions. In other crystal systems, n is large 
in close-packed directions. Similarly more open structures of high-temperature 
polymorphic phases have lower refractive indices than crystalline forms of the 
same composition. In Si0 2 , for example, in order of increasing density 
^giass — 1 '46, ^tridymite 1.47, ^ cr istobaiite — 1*49, and — 1.55. Com¬ 

pounds with predominant covalent bonding have higher refractive indices than 
ionically bound solids. For example, in order of increasing degree of covalent 
bonding n ZnC i 2 = 1.68, n Zn0 = 2.08, n ZnS = 2.37, n ZnSe = 2.57, and n ZnTe 
= 3.56. 

The excellent transmission of dielectric materials in the visible region of the 
spectrum is terminated at short wavelengths with the onset of the ultraviolet 
absorption edge. The critical radiation wavelength X c at which this occurs is 
given by the familiar equation 

X c = he/E g or \ (pm) = l.2A/E g (eV). (11-8) 

Values physically correspond to electronic transitions from the filled valence- 
band levels across the energy gap E g to the unfilled conduction-band states. 
Multiple peaks near the UV absorption edge indicate the complexity of these 
processes. At long wavelength the high optical transmission is once again 
limited, this time by absorption due to the vibration of lattice ions in resonance 
with the incident radiation. The frequency of maximum absorption is related to 
the force constant and masses of vibrating anions and cations. 
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11.2.4. Optical Effects in Semiconductors 


Lastly, some elementary aspects of the optical properties of semiconductors 
will be briefly considered. The absorption coefficient behavior of a number of 
important semiconductor materials was previously noted in Fig. 7-13. The 
most prominent feature is the rapid decrease in absorption at the critical or 
cutoff wavelength X c . Here as in dielectrics the magnitude of the energy gap 
between the valence and conduction band governs the value of X c . For 
wavelengths larger than X c , semiconductors are essentially transparent because 
no mechanisms exist to excite electron transitions. However, for wavelengths 
shorter than X c , electrons can be stimulated into the conduction band. In 
addition, the generated free carriers can now absorb quanta of energy and 
occupy excited levels in the continuum of conduction-band states. Semiconduc¬ 
tors then behave like metals and are highly reflective. 

An interesting phenomenon occurs in this regime of free-carrier light 
absorption. This effect has relevance to semiconducting In 2 0 3 :Sn films that 
are widely used because they have the unusual property of being transparent 
conductors (see Section 11.2.5). In semiconductors (and metals), values of n 
and k are generally not independent of each other but are connected by 
electromagnetic theory through so-called dispersion relations (Ref. 10) 


n 2 - k 2 = e x - n c q 2 /m*e 0 (o) 2 + 7 2 ), 


Ink 


vW_ 

m*e 0 w(o> 2 + 7 2 ) 


(H-9) 

( 11 - 10 ) 


In these formulas, and £ 0 are the permitivities of the solid and free space, 
respectively, n c is the carrier concentration in the conduction band, m* is the 
effective mass of the charge carrier, q is the electronic charge, oj is the 
angular frequency of the incident radiation of wavelength X(a> = 2 ttc/X), and 
7 is the reciprocal of the carrier relaxation time. Basically, 7 is inversely 
related to the electrical conductivity. When n = k, the optical properties 
change radically. The critical value of oj for which this happens is called the 
plasma frequency o ) p , and corresponding to it is the plasma wavelength \ p . At 
the optical transition the conductivity is high, and 7 2 may be neglected. It is 
then easy to show that 


K 


lire 

Q 


£ 0 £ l /»* \' /2 

n c I 


( 11 - 11 ) 


which only depends on the carrier concentration. When X> X p , the film 
exhibits metal-like reflection, whereas if X < \ p the high transmittance of a 
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Figure 11*3. Reflectance versus surface resistivity (sheet resistance) for In 2 0 3 :Sn 
layers. Variations in film thickness and carrier concentration are responsible for the 
wide range of p/d values. (From Ref. 11). 

dielectric occurs. This means that In 2 0 3 :Sn films can be prepared to possess 
the desired, though not independent, admixture of optical reflectivity and 
electrical resistivity as shown in Fig. 11-3. 

In photonic devices the operating wavelength is usually either \ c (light-emit¬ 
ting diodes, lasers) or less than \ c (photodetectors, solar cells). It is essential 
in the latter applications that light be absorbed for the devices to operate. The 
opposite is required of semiconductors in optical coating applications where 
transparency is essential. Another difference relates to structure. In photonic 
devices either bulk or thin-film single-crystal semiconductors are essential for 
operation. On the other hand, polycrystalline or amorphous Ge and Si are 
deposited in optical coatings. The extension of the spectral regime of trans¬ 
parency down to \ c = 0.82 jim for amorphous relative to polycrystalline Si, is 
an advantage in optical coatings; in solar cell use however, the constricted 
range of absorption is a disadvantage, other things being equal. Films possess¬ 
ing tailor-made energy band gaps and refractive indices, prepared from 
compound semiconductor mixtures, have been discussed previously (Chap¬ 
ter 7). 

11.2.5. Transparent Conducting Films (Ref. 12) 

Somewhere between metals and dielectrics is an interesting class of materials 
known as transparent conductors. According to electromagnetic theory, high 
conductivity and optical transparency are mutually exclusive properties since 




11.2 Properties of Optical Film Materials 


517 


photons are strongly absorbed by the high density of charge carriers. Although 
there are materials that separately are far more conductive or transparent, the 
transparent conductors dealt with here exhibit a useful compromise of both 
desirable properties. Broadly speaking, transparent conducting films consist 
either of very thin metals or semiconducting oxides. The first widespread use 
of such films was for transparent electrical heaters in aircraft windshield 
de-icing applications during World War II. Today they are used for automobile 
and airplane window defrosters, liquid crystal and gas-discharge displays, front 
electrodes for solar cells, antistatic coatings, heating stages for optical micro¬ 
scopes, IR reflectors, photoconductors in television camera vidicons, and 
Pockel cells for laser Q-switches. 

Metals that have been used as transparent conductors include Au, Pt, Rh, 
Ag, Cu, Fe, and Ni. Simultaneous optimization of conductivity and trans¬ 
parency presents a considerable challenge in film deposition. At one extreme 
are discontinuous islands of considerable transparency but high resistivity; at 
the other are films that coalesce early and are continuous, possessing high 
conductivity but low transparency. For these reasons, the semiconducting 
oxides Sn0 2 , ln 2 0 3 , CdO, and, more commonly, their alloys (e.g., indium-tin 
oxide, also known as ITO) doped ln 2 0 3 (with Sn, Sb), and doped Sn0 2 (with 
F, Cl, etc.) are used. 

Deposition processes have included chemical and physical methods. Hydrol¬ 
ysis of chlorides and pyrolysis of metalorganic compounds are examples of the 
former; reactive evaporation and sputtering in an oxygen environment are 
examples of the latter. Optimum film properties require maintenance of tight 
stoichiometry. Since glass substrates, which are often heated close to the 
softening temperature, are commonly employed, care must be taken to prevent 
stresses and warpage. Generally films of comparable quality can be produced 
by chemical or physical deposition processes. 

The inverse relation between film reflectivity and sheet resistance should be 
noted in Fig. 11-3. 

11.2.6. Optical Properties of Thin Dielectric and Semiconductor Films 

The optical properties of dielectric and semiconductor films are collected in 
Table 11-2. Practically all of the data are for evaporated films, but some 
results for sputtered films are included. Only a portion of the exhaustive 
information on these films, provided by Pulker (Ref. 5) in his excellent 
treatment of this subject, is considered here. Among the important issues 
concerning optical films are the magnitudes of n and k f the spectral range of 
transmission, the dependence of properties on film structure and deposition 
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processes, and environmental effects. These issues will now be addressed in 
turn. 


77-2.6-7- Magnitude of n and k — Spectral Range . For the most part, n 
in fluoride and oxide films has a value less than 2 at the reference wavelength 
of 0.55 /xm. For many applications, however, it is important to have films with 
higher refractive index in the visible range. To meet these needs, materials like 
ZnS and ZnSe are employed. High transmittance is an essential requirement in 
optical films, and as an arbitrary criterion only materials with an absorption 
constant less than a = 10 3 /cm are entered in Table 11-2. In practice, how¬ 
ever, only films with significantly lower absorption can be tolerated. For 


Table 11 -2. Optical Properties of Films 


Film 

Composition 

Structure 

(packing density P) 

Transmittance 
Range (pm) 

Index of 

Refraction 

NaF 

C 

0.2- 

1.29-1.30 

LiF 

C 

0.11-7 

1.3 

CaF 2 

C; P = 0.57-1 

0.15-12 

1.23-1.46 

Na 3 AlF 6 

C; P = 0.88 

0.2-14 

1.32-1.35 

A1F 3 

A; P = 0.64 

0.2- 

1.23 

MgF 2 

C; P = 0.72 

0.11-4 

1.32-1.39 

ThF 4 

A 

0.2-15 

1.50 

LaF 3 

C; P = 0.80 

0.25-2 

1.55 

CeF 3 

C; P = 0.80 

0.3-5 

1.63 

Si0 2 

> 

*0 

11 

0 

0.2-9 

1.45-1.46 

A1 2 0 3 

A; P = 1 

0.2-7 

1.54 

MgO 

C 

0.2-8 

1.7 

y 2 o 3 

A 

0.3-12 

1.89 

La 2 0 3 

A 

0.3- 

1.98 

Ce0 2 

C 

0.4-12 

2.2 

Zr0 2 

A; P = 0.67 

0.34-12 

1.97 

SiO 

A 

0.7-9 

2.0 

ZnO 

C 

0.4- 

2.1 

Ti0 2 

A 

0.4-3 

1.9 

ZnS 

C; P > 0.94 

0.4-14 

2.3 

CdS 

C 

0.55-7 

2.5 

ZnSe 

C 

0.55-15 

2.57 

PbTe 


3.5-20 

5.6 

Si 

A 

1-9 

3.4 

Ge 

A 

2-23 

4.4 


C — crystalline; A = amorphous. 
From Ref. 5. 
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Figure 11-4. Spectral region of high transparency of dielectric films. Indices of 
refraction are given at the indicated wavelengths. (From Ref. 8). 

example, in laser AR coatings losses must be kept to less than 0.01%, 
corresponding to k ~ 4 X 10“ 5 or a = 10/cm at X = 5500 A. The spectral 
regions of high transparency for the most commonly employed dielectric films 
are depicted in Fig. 11-4 together with corresponding n values. 


11.2-6.2. Film Structure and Its Effect on n. Generally n values in films 
are lower than those in bulk. For example, n - 2.35 for bulk cubic ZnS at 

o o 

6330 A, whereas n = 2.27 in a 1000-A-thick film. Thinner films show even 
lower values, and thicker film behavior approaches that of the bulk. Film 
structure apparently has an important influence on optical properties. The 
connection is through Eq. 11-7, which indicates that n decreases with decreas¬ 
ing film density, a fact commonly observed in porous films. 

Oxides like A1 2 0 3 , Si0 2 , Zr0 2 , and Ti0 2 have a tendency to be amor¬ 
phous when deposited as films. In common with metals and semiconductors, 
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Figure 11-5. SEM micrograph of columnar structure of a ZnS film layer deposited 
by diode ion plating. (Courtesy of H. A. Macleod). 


however, some dielectric films also exhibit a zone structure. As an example, 
the columnar structure of ZnS is shown in Fig. 11-5. Depending on the 
proportions of grains and voided boundaries present, differing film densities 
result. The packing density P (defined by Eq. 5-44) for optical films lies in the 
range 0.7-1.0. The simplest model of a columnar grain structure consists of an 
array of close-packed cylindrical columns of identical diameter, and it is a 
simple matter to show that P = 0.907 in this case. Other structural models are 
possible. For example, unevenly contracting columns will form truncated 
cones with P < 0.907; columns expanding into a hexagonal shape will be 
more densely packed than cylinders and P > 0.907. 

Various relationships have been proposed to relate refractive index to 
microstructure, or n or P (Ref. 13). The simplest is a linear law of mixing; 
i.e., 

n = n s P + n v (l - P), (11-12) 

where n s and n v are the refractive indices associated with the solid film and 
voids (or pores) respectively. 

11.2.6.3. Deposition Processes and Their Effect on n and k. In order 
to produce films with low absorption losses, we must carefully control various 
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parameters in the traditionally employed evaporation processes. First of all, the 
materials selected for deposition should be carefully selected as to purity. 
Substrates must be scrupulously cleaned. It is recommended that vacuum- 
sintered and outgassed materials, or even pieces of dense single crystals, be 
used to avoid gas outbursts and spitting from the melt during evaporation. 
Reactions of fluorides or oxides with resistively heated crucible materials lead 
to contamination and absorption in the UV. Electron-beam evaporation has 
overcome this difficulty, and virtually all of the oxides can be deposited as 
highly transparent, undecomposed films this way. To ensure a high and 
constant evaporation rate and avoid decomposition through overheating, a 
defocused electron beam (e.g., ~ 2 cm in diameter) is recommended. Another 
technique for producing highly transparent oxides, even when lower oxides or 
metals are used as starting materials, is reactive evaporation. In such a case, 
evaporant and oxygen impingement rates must be carefully controlled. 

The substrate temperature during deposition has an important effect on 
refractive index. In general, n increases as the substrate temperature is raised. 
In the range 40-300 °C, the difference is small in A1 2 0 3 because the films are 
amorphous at both substrate temperatures. In Ce0 2 , however, an increase in n 
occurs over the same temperature range (Ref. 8). These films were crystalline, 
and the grain size increased substantially with increasing substrate temperature. 
The rise in n apparently stemmed from pore elimination during sintering. 

Improvement in the preparation of optical thin films over the years can be 
appreciated in the case of ZnS. A steady decrease in k from 1.1 X 10“ 2 to 
2.0 x 10~ 3 and then to 9 X 10“ 5 was reported in 1952, 1956, and 1974, 
respectively, over the spectral range 5100 to 5500 A. No doubt, general 
property improvement and film reproducibility were the result of more careful 
attention to deposition process parameters. The relationship of the latter to film 
properties is conveniently summarized in Table 11-3. Strong, established, and 
possible dependencies are noted. The complexity of the interplay between film 
properties and process variables is evident, particularly when one parameter 
may alter several properties. 

The quest for improved and more reproducible coatings is unceasing as the 
needs of modem precision optics extends to such applications as high-power 
lasers and laser gyroscopes, where extreme stability and high reflectance are 
required. In recent years, the use of sputter deposition and ion-assisted 
technology in particular, has resulted in significant improvements in optical 
properties of films. Although still largely in the research and development 
stage, the results are promising enough to warrant some discussion here. The 
deposition processes employed include ion-assisted deposition (IAD), ion 
plating, conventional sputtering as well as the variants employing ion sources 



Table 11-3. Influence of Process Variables on Optical Properties of Thin Films 


Film Property 

Substrate 

Material 

Substrate 

Cleaning 

Starting 

Material 

Glow 

Discharge 

Evaporation 

Method 

Rate 

Pressure 

Substrate 

Vapor Temperature 

Refractive 



E 

E 


S 

S 

S 

S 

index 










Transmission 



E 

E 

E 

P 

P 


P 

Scattering 

S 

E 

P 

P 

E 

P 

S 

E 

S 

Geometric 


E 


E 


E 

E 

E 

S 

thickness 










Stress 

s 

P 

S 



E 

E 

E 

E 

Adherence 

E 

S 


S 

E 

E 

E 

E 

E 

Hardness 

E 

E 


E 


E 

S 

E 

S 

Temperature 

E 





E 

E 

E 

S 

stability 










Insolubility 

P 

E 

E 

E 


E 

S 

E 

S 

Resistance to 

P 

E 

E 

E 

E 

E 



E 

laser radiation 










Defects 

S 

E 

E 

E 

S 

E 

P 


E 


S = strong effect; E = established effect; P = possible effect. 
From Ref. 14. 
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WAVELENGTH (pm) 

Figure 11-6. Summary of the variation of n in Ti0 2 films deposited by ion-based 
and conventional methods. Solid line is the bulk Ti0 2 (rutile) data. IAD—O#©; 
sputtering— A V A ▼ f ; reactive evaporation, DRH3; dotted line represents data 
from conventionally deposited films. (From Ref 14 © 1986 Chapman and Hall). 

discussed in Chapter 3. As a result columnar growth is interrupted, the 
crystallite size decreases, the packing factor increases, and the film density 
approaches bulk values with net improvement in optical film properties. A 
good example is provided by amorphous Ti0 2 where the spread of the 
refractive index obtained by different investigators, employing various ion- 
based techniques, is presented in Fig. 11-6. Although very few results ap¬ 
proach the optical properties of bulk rutile, in virtually all cases at all 
wavelengths, conventionally deposited Ti0 2 has a lower refractive index than 
films prepared employing ion bombardment. Convincing evidence for the 
benefits of ion bombardment was obtained through real-time monitoring of 
light transmission through substrates during deposition. Switching the ion 
beam on and off caused corresponding variations in the transmission to occur 
in concert. They were interpreted in terms of changes in n and k. Experiments 
on Zr0 2 films also clearly revealed an increase in n and decrease in k as a 
result of ion bombardment (Ref. 14). 


11.2.6.4. Environmental Effects. Exposure of optical films to elevated 
temperatures and corrosive and humid environments has deleterious effects on 
their properties. Temperature variations produce reversible and irreversible 
property changes. Typically, dn/dT ~ 10 -5 to 10 -4 per degree Celsius, and 
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positive as well as negative refractive index changes are observed. Absorption 
coefficient variations in semiconducting films are also reversible over small 
temperature ranges, but for large temperature excursions irreversible changes 
occur. Films may density with an increase in packing and refractive index. 
Reactions between film layers may form compounds and even free metal, with 
attendant increases in absorption. Thermally induced stresses having harmful 
implications with respect to film adhesion and cracking are also a possibility. 
Resistance of optical films to etching and chemical attack of saltwater environ¬ 
ments, S0 2 and H 2 S gases, cleaning agents, acids, and sometimes even 
fungus, is highly desirable. 

The subject of water absorption by optical films, particularly those of low 
refractive index, has been of concern for many years. Water vapor penetrates 
through the microvoid network and gaps between columnar grains of low-pack¬ 
ing-density films by capillary action. The net effects are swelling, changes in 
refractive index, and overall shifts in the spectral transmittance of optical 
coatings. 


11 . 3 . Thin-Film Optics 


11.3.1. Nonabsorbing Films (One Interface) 

In this section we present the theory of the optical behavior of a single 
transparent (nonabsorbing) film on a nonabsorbing substrate. Transparent films 
are of great practical importance because applications of all kinds are based on 
one or, more commonly, multiple layers of such films. Crucial to understand¬ 
ing these applications is how radiation is reflected and transmitted at a single 
interface between two media of differing index of refraction. 

Let us consider Fig. 11-7, which depicts an interface between an optically 
isotropic medium with an index of refraction n x and a vacuum region of index 
n 0 (= 1). Electromagnetic radiation is incident on the n 0 /n { interface at an 
angle 4> Q and refracted in the medium at an angle <t> l with respect to the 
normal. The electric field vector of the incident wave, is decomposed into 
two components, and , respectively perpendicular and parallel, to the 
plane of incidence. Through the use of the Maxwell equations with appropriate 
interfacial boundary conditions, the amplitudes of the components of the 
transmitted , $ x ) and reflected (<f 0 x > ^o' 1 ) waves in the medium and 
vacuum can be determined. The results of the calculation are the celebrated 
Fresnel coefficients for reflection (r) and transmission (/) at the n 0 fn x 
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Figure 11-7. Light incident on an interface at an oblique angle undergoing both 
reflection and refraction. 


interface given by 


x V 

w 0 cos <j> 0 - n x cos <t> } 

'■ - c 

n 0 cos <j> 0 + n x cos <f> x 

1, # 0 

rt 0 cos 4> x - n x cos <£ 0 

ri gf 

n 0 cos <j> x -1- jCOs 0 O 

6\ x 

t x - 1 

2/* 0 cos <j> 0 

1 g i - 1 

n 0 cos <t> o + n x cos <j> x 

/pii 

/it - _L 

2 n 0 cos <£ 0 


n 0 cos <t >i + /z,cos <f > 0 


(11 -13a) 
(11-13b) 
(1 l-13c) 
(ll-13d) 


Extensions of these equations are the basis for the mathematical analysis of the 
optical behavior of single films and multilayer film structures. Changing the 
subscripts from 0,1 to 1,2 makes the results applicable to the film (n^-sub- 
strate ( n 2 ) interface in a single film layer. 

For normal incidence, where <f> 0 = <f> x = 0 the distinction between the 
perpendicular and parallel electric field components vanishes and the reflection 
and transmission coefficients are given by 


± II "‘0 

r = r,= r = 


n n - n, 


n, + n n 


t = tf= t\ = 


2 tin 


«i + n 0 


(11-14) 

(11-15) 


The equation for r contains information on the amplitude and phase of the 
reflected wave. For example, if /i 0 = 1 and n x = 1.52, r = (1 - 1.51)/(1 + 
1.52) = -0.206, which is equal to +0.206e ±/ir . This represents the well- 
known phase jump of tv during reflection at an optically denser medium. If, 
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however, the wave propagates in the reverse direction, at the n x / n 0 interface 
n 0 = 1.5 and n x = 1 (for consistency) and r - (1.52 - 1)/(1.52 + 1) = 
0.206. There is no phase jump in this case. For nonabsorbing media the ratio 
of the energy reflected to that incident is r 2 or R, the reflectance; its 
magnitude is given by Eq. 11-5. Similarly the ratio of the energy transmitted to 
that incident is (n l /n 0 ) 2 t 2 or T , the transmittance, given by 

T= 4n l n 0 /(n i + n 0 ) 2 . (11-16) 

Clearly, R + T = 1. 

11.3.2. Nonabsorbing Film (Two Interfaces) 

We now consider reflection of light at both film interfaces in Fig. 11-8 where 
radiation may be partially reflected or transmitted. The separation between the 
boundaries, tf, is of the order of magnitude of the wavelength X so that 
interference effects occur. It is assumed that the film is plane parallel and 
homogeneous. The optical properties of the film will now depend on n 0 , n x , 
n 2 , tf, and X. If a plane wave of unit amplitude is normally incident on the 
n 0 /n x interface at A, the fraction refle cted is r x = ( n 0 - n x )/(n x + w 0 ), and 
the wave transmitted has amplitude y/ 1 - r\ . At point B of the film-sub¬ 
strate (n x /n 2 ) interface, r 2 = {n x - n 2 )/(n 2 + n x ) so that a wave of ampli¬ 
tude r 2 y/l — r 2 is reflected back to point C on the first interfac e. Here the 
wave is reflected again into the film with amplitude - r x r 2 yj 1 - r 2 e l6 . 
Compared with the first reflection at A, the emerging wave at C has twice 
traversed the optical thickness, n x d , of the film. The total path difference, 
2 n x d, implies a phase 6 of amount b = (27r/X)(path difference), and there¬ 
fore b = 4 irn x d/\. The portion of the wave transmitted into free space at C 
is attenuated by the factor y/ 1 - r 2 so that its amplitude is given by 
r 2 (\ - r 2 ). Adding the two emergent waves in amplitude and phase yields 



Figure 11-8. Reflections at both surfaces of a nonabsorbing, homogeneous thin film 
at normal incidence. 
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r \ + r i (1 " r x )e~ l6 . Continuing the process and summing the multiply re¬ 
flected beams whose amplitudes decline with repeated interfacial impingement, 
the result is 

/■, + r 2 exp - id 

r = -. (11- 

1 4- r^exp - id 


By squaring r according to the rules of complex number algebra, the re¬ 
flectance R is expressed by 


r \ + t\ + 2r 1 r 2 cos 5 
1 + r\r\ + 2/-,r 2 cos d 

Similarly, the transmittance is given by 


« 0 1 + r^rl + lr x r 2 cos d 


(11-18) 


(11-19) 


where t x , t 2 are extensions of Eq. 11-15. 

The first of these equations is an extremely important one with many 
implications for the design and analysis of optical films. First, R is plotted as a 
function of phase or optical thickness in Fig. 11-9 for a number of different 
film refractive indices. As a reference, the refractive index of the uncoated, 



Figure 11-9. (Lower portion) Reflectance as a function of n x d for films of various 
n x values on a glass substrate of n 2 =1.5 (also see Fig. 6-2); (upper portion) 
reflectance at a nonabsorbing uniform film between two media with same index of 
refraction. (From Ref. 3). 
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nonabsorbing substrate is n 2 = 1.5, the value for plate glass. The oscillatory 
nature of the reflected light intensity, caused by interference effects, has a 
periodicity related to the film thickness and index of refraction. This is the 
basis for experimentally determining the thickness of transparent films if the 
index of refraction is known (see Chapter 6). Conversely, the optical proper¬ 
ties of the film can be determined at a particular wavelength if its thickness is 
known. Maxima or minima in the reflected intensity occur at specific film 
thicknesses, for given wavelengths, depending on whether the refractive index 
of the film is greater or less than that of the substrate. In the former case the 
reflectivity is enhanced, whereas in the latter case reflectivity is diminished. 
Optimization of these two effects has led to the development of dielectric 
mirrors and antireflection coatings, respectively. 

To quantify the issues related to antireflectivity, Eq. 11-17 reveals that r 
vanishes when r x + r 2 exp - id = 0 or when the denominator goes to infinity. 
The latter is an impossibility, since r l and r 2 are less than or equal to 1. The 
remaining condition can be decomposed into two real transcendental equations: 
(a) /* | + /* 2 cos 6 = 0 and (b) r 2 sin 6 = 0. Equation (b) implies that 6 = 0, 
±7T, ±2 tt, ±37r, etc., but the simultaneous satisfaction of equation (a) 
requires the selection of 6 = ±7r, ±37 t, ±57t, etc. Under these conditions, 
r \ “ r 2 = 0 or ("o “ /2,)/(/i 0 + *i) = («i - « 2 )/( /2 i + n i)' Therefore, 

n l = \/n 0 n 2 . (11-20) 

Since 6 = 4irn ] d x /X = 7r, 37r, Sir , etc, 

X 3X 5X 

n t d = — , — ,etc. (H-21) 

Equations 11-20 and 11-21 represent the amplitude and phase conditions for 
zero reflectance, respectively. In the design of a one-layer antireflection 
coating, the film index of refraction should be the geometric mean of the 
refractive indices of adjacent media. This is only strictly true for the wave¬ 
length X for which the optical thickness of the film is X/4, 3X/4, etc. 

To coat a glass lens (n 2 = 1.52), a film with n x = x/1.52 = 1.23 is 
optimal for antireflection purposes. Clearly, this is only one consideration 
among many, including availability, ease of deposition, hardness, and environ¬ 
mental stability, which must be taken into account when choosing the film 
layer. The most widely used AR coating is a X/4-thick film of MgF 2 with 
n x = 1.38. It can be used to coat either glass or acrylic substrates. In the 
absence of an AR coating, glass will exhibit a reflectance of ((1.0 — 
1.52)/(1.0 + 1.52)) 2 = 0.043. Suppose it is desired to reduce the reflectance 
at a wavelength of 5500 A. Then the film thickness required is X/4/ij or 
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(a) 



Figure 11-10. (a) Single (S) and double (D) layer antireflection coating characteris¬ 

tics. (b) Broadband antireflection coating characteristics. (From Ref. 5). 

© 

5500/4(1.38) = 996 A. Under these conditions the reflectance is given by Eq. 
11-18 with r x = (1.0 - 1.38)/(1.0 + 1.38) = -0.160, r 2 = (1.38 - 
1.52)/(1.38 + 1.52) = -0.0483, and 5 = tt. Substitution in Eq. 11-18 yields 
a value of R = 0.0126, indicating an almost fourfold decrease in reflectivity. 
Greater improvements occur for higher n 2 values of the underlying substrate. 
As an example, for an uncoated glass with n 2 = 1.75 the reflectance is 0 . 074 . 
With a quarter-wave-thick MgF 2 coating, R is reduced to 0 . 0025 . At other 
wavelengths, but the same optical thickness, R will be different because n ] 
varies with X (i.e., dispersion) and because of changes in 6. The reflectance 
reduction with a single-layer AR coating as a function of wavelength is shown 
in Fig. 11-10. 
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It is instructive to end the discussion with several observations made by 

Anders. (Ref. 3) 

1. There is a more rapid variation of 5 and hence R with X for a 3X/4 film 
than for a X/4 film. Therefore, R will be less dependent on wavelength 
with a X/4 coating. 

2. It is not always true that films of high refractive index give a high 
reflectance, whereas those with low refractive index yield AR coatings. The 
rule is that if the reflected amplitudes r x and r 2 are of the same sign, 
antireflection behavior is observed; if they are of opposite sign, then 
reflection from the surface is enhanced. 

3. For very large amplitude values of r x = -r 2 , R approaches 100% and the 
reflection becomes zero only in narrow wavelength bands at X/2, X, 
3X/2,. . . . This occurs physically when a film is sandwiched between two 
media of the same refractive index, i.e., cemented film (n 2 / n x /« 2 ), as 
shown in Fig. 11-9. 


11.3.3. Absorbing Films 

The mechanisms by which materials absorb radiation were treated earlier. 
Absorption effects can be formally incorporated into the Fresnel equations by 
replacing the refractive index n by the complex refractive index; i.e., N = n 
- ik. For the case of reflection due to normal incidence of light at an interface 
between nonabsorbing and absorbing media of refractive indices n Q and 
n x - ik x , respectively, 

n o ~ n \ + 

r = -. 

n 0 + n \ ~ ik\ 

By evaluating | r x \ 2 , we have the reflectance formula Eq. 11-6. 

As an example, consider the reflectance of A1 front surface mirrors pro¬ 
duced some 15 years apart. Hass (Ref. 15) measured the optical constants of 
A1 to be N A] = 0.76 - /5.5 in 1946 and N M - 0.81 - /5.99 in 1961. Substi¬ 
tution in Eq. 11-6 with n 0 = 1 yields respective reflectances of 0.909 and 
0.916. Improved deposition technology including higher and cleaner vacua, 
purer metal, and higher evaporation rates were probably the cause of the 
enhanced reflectance. An R value of 0.91 could be achieved with a hypotheti¬ 
cal absorption-free material with n ~ 43. This extremely high value can be 
thought of as the effective refractive index for aluminum. 

A frequently asked question regarding thin films is, how thick must a metal 
film (on a transparent substrate) be before it is continuous? By this is meant the 
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thickness at which it can no longer be seen through. A simple estimate can be 
obtained by arbitrarily assuming that a drop in transmitted intensity by a factor 
of 1 /e occurs when the film is continuous. Therefore, the use of Eq. 11-3 with 
7// 0 = \/e yields 4irkd /X = 1, or d = \/4irk. It is clear that the answer to 
the question not only depends on the type of metal but also on the wavelength 
of light used to view it. The critical thickness for A1 films at 5500 A is 82 A, 
whereas for Au films it is 185 A. It is common experience, however, that films 
that are considerably thicker exhibit some transparency. The reason is that 
ultrathin films condense in an island structure of discrete clusters rather than as 
planar, continuous, homogeneous layers assumed in the optical theory. An 
alternative approach to this problem, which is left to the reader as a lengthy but 
healthy exercise in the use of complex numbers, is to consider the optical 
structure n 0 /n l - ik x /n 2 corresponding to free-space/metal film/substrate. 
From Eq. 11-19, T can be calculated for different film thicknesses. 

By inverting the order of the last two optical components, i.e., n 0 /n x / n 2 
— ik 2 , we have the case of the back surface or protected mirror. It is 
commonly believed that the reflection properties of the mirror are unaffected 
by the protective layer. In reality, the latter actually reduces the reflectance in 
the visible, and, particularly, in the UV and IR ranges. 

The remaining case, n 0 /n { - ik x /n 2 - ik 2 , models the optical behavior 
of an absorbing film on an absorbing substrate. This structure was recently 
used to determine the real-time kinetics of regrowth of epitaxial Si into an 
amorphous Si (surface) layer. By bouncing a He-Ne laser beam off the surface 
and monitoring the reflected beam intensity, the instantaneous position of the 
epitaxial-amorphous interface could be unfolded from the attenuated periodic 
signal (Ref. 16). Such a measurement is possible because the optical constants 
of crystalline and amorphous Si differ. (See Problem 9, p. 543.) 


11.4. Multilayer Optical Film Applications 


11.4.1. Introduction 

Once the basic principles governing the applications of single dielectric films 
and their deposition methods were firmly established, extension to multilayer 
systems was naturally driven by several factors (Ref. 17): 

1. By suitable variations in design, it is possible to obtain improved AR 
properties over a broader spectral range. 

2. Systems with a vast variety of optical filtering properties can be achieved 
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Figure 11-11. Typical applications of thin films and film systems in optics. (From 
Ref. 5). 


usually with the use of many film layers (sometimes a dozen or more) but 
with only a very limited number of materials (e.g., MgF 2 and ZnS). 

3. Multilayer optical filters have advantages over other types of filters. The 
reason is that there is very little absorption loss in dielectric film layers, 
since they rely on the effects of interference. 

4. The principles of design of optical systems applicable to one region of the 
electromagnetic spectrum (e.g., visible) are also valid in other regions 
(e.g., UV and IR). 

Various types of thin-film optical component characteristics are shown in 
Fig 11-11 where the desired reflectance and transmittance properties are 
schematically indicated as a function of wavelength. 


11.4.2. AR Coatings 

Antireflection coatings constitute the overwhelming majority of all optical 
coatings produced. They are used on the lenses of virtually all optical 
equipment, including cameras, microscopes, binoculars, range finders, tele¬ 
scopes, and on opthalmic glasses. Because of the reflection at each air-glass 
interface, intolerably large light losses can rapidly mount in complex lens 
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systems. Neglecting absorption effects, the transmission of an optical system is 
given by 

7= (1 -/?,)(1 -* 2 )(1 . (11-23) 

where the R i are the reflectances (Eq. 11-5) at the individual optical inter¬ 
faces. For example, in a system with uncoated lenses consisting of 20 
interfaces, each with R = 0.05, the value of T = (0.95) 20 = 0.358. If, how¬ 
ever, R is reduced to 0.01 by means of AR coatings, then T = 0.818. The 
measured transmission is actually somewhat higher than these estimates be¬ 
cause light is backreflected at internal air-glass interfaces. The improvement is 
impressive indeed. In addition to enhancing light transmission, AR coatings 
reduce glare. The so-called veiling glare causes a reduction in image contrast 
by illuminating regions of the image that should normally be dark. Lastly, 
since lens surfaces fortuitously act as mirrors in addition to refractors, spurious 
ghost images are frequently generated. These are also reduced through the use 
of AR coatings. Other optical systems that derive benefit from the use of such 
coatings to maximize the capture of light include solar cells, infrared detectors, 
and magneto-optical devices. 

In the case of the double-layer coating where the indices of refraction vary 
successively as n Q (= 1 )/n x /n 2 jn 3 from free space to the substrate, the 
complex reflectivity amplitude is given by 

r, + r 2 e~ i8 ' + r,e /<fi ' +62> 4- r x r 2 r,e~ ib2 

r = — ! - TT --- .... T (11-24) 

1 + r i r 2 e"' s ' + /• | r 3 e~ ,<4 ' +42) + r 2 r 3 e~' 42 


by analogy with Eq. 11-17. For normal incidence the indicated r for each of 
the three interfaces is given by 


n = 


«/- 1 + n i 


and 


6 , 


47r«jCf, 47Trt 2 ^2 

-r~ ; —r - 


where d x and d 2 are the thicknesses of the coating layers. Zero reflectance at 
one wavelength will obtain when the condition n 2 = n { y/n^/ n 0 is fulfilled. 
Once film n x has been selected, this condition serves to specify the optimal 
value of n 2 . The improvement of a double-layer AR coating relative to the 
single-film coating is shown in Fig. 11-10a. Interestingly, although the double 
layer results in a considerable reflectance reduction at wavelengths centered 
about 5500 A, the response is worse at the spectral extremes due to the high 
curvature of the R vs. X dependence. Greater care is required in controlling 
the film thickness in bilayer coatings than in single layers. In the latter a film 
thickness error simply means that the reflectance minimum is shifted to another 
wavelength. In contrast, an error in double-layer thicknesses can not only 
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eliminate reflection minima but even increase reflectance. Multilayer film 
thicknesses must be even more stringently controlled. 

The extension of the analysis to a multilayer stack of dielectric films of 
various thicknesses and n values is straightforward, though cumbersome. 
Exact formulas exist for three and more layers. Modem broadband AR 
coatings generally consist of three to seven film layers. An example of 
the reflectance characteristics of such a multilayer coating is shown in 
Fig. 11-lOb. 


11.4.3. Multilayer Dielectric Stacks 

Since the high reflectance of a single X/4 film is due to the constructive 
interference of the beams reflected at both surfaces, the effect can be enhanced 
by phase agreement in the reflected beams from multiple film layers. What is 
required is a stack of alternating high (H) and low (L) index X/4 films. Next 
to the substrate is the usual high index layer so that the stacking order is 
HLHLHLHL.... For z layers it has been calculated that the maximum 
reflectance is given by (Refs. 4, 17) 


^X/4 “ 


/ /i* +l - 

\n^' + nl-'n 2 


2 


(11-25) 


where n H , n L , and n 2 are the high, low, and substrate indices. An expansion 
of Eq. 11-18 for n, > n 2 shows that the z layers are equivale nt to a single 
layer whose effective refractive index is equal to 1 ■ 

The spectral characteristics of such a multilayer stack are shown in Fig. 

11-12 for the case of a variable number of alternating layers of ZnS and MgF 2 . 
Also shown is a portion of the microstructure of a multifilm stack composed of 
these materials. It is clear that the magnitude of the reflectance increases with 
the number of layers. The number of sideband oscillations outside the high-re¬ 
flectance zone also increases with number of layers. The spectral width of the 
high reflectance zone is a function of the ratio of the refractive indices of the 
involved films, and there are a couple of practical ways to extend it. One is to 
select materials with n H and n L that are higher and lower, respectively, than 
those of ZnS and MgF 2 . Another is to broaden the basis of design to include 
several wavelengths. In such a case the dielectric stack would be composed of 
staggered layer thicknesses so that consecutive maxima would overlap. In this 
way 15 layers of ZnS and Na 3 AJF 6 with different optical thicknesses can be 
used to span the visible range. By similar methods dielectric mirrors are 
designed to operate in the infrared or ultraviolet with very small residual 
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Figure 11-12, (a) Spectral characteristics of multilayer stacks formed of alternating 

X/4 layers of ZnS and MgF 2 onglass(n 2 = 1.52) as a function of lirnd/X. Normally 
incident light with X = 4600 A assumed. Number of layers in each stack is indicated. 
(From Ref. 18). (b) Transmission electron micrograph of a replica of the ZnS/MgF 2 
multilayer cross section. (Courtesy of K. H. Guenther). 

absorption. In reducing the difference between n n and n L , a narrow-band 
reflection filter, the minus filter of Fig. 11-11, can be generated. 

Multilayer dielectric interference systems are ideally suited as reflection 
coatings for fully reflecting and partially transmitting laser mirrors. Negligible 
absorption means that reflectances of almost 100% can be achieved. Typical 
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material combinations have included ZnS-ThF 4 , Ti0 2 -Si0 2 , and other oxide 
combinations in either broad or narrow spectral-band mirror configurations. 
Much attention must be paid to substrates employed where low light scattering 
and good film adhesion are critical requirements. 


11.4.4. Cold Light and Heat Mirrors 

There are two noteworthy practical variants of dielectric mirrors—cold light 
and heat mirrors. The cold light mirror spectral characteristics are shown in 
Fig. 11-13. It has high reflectivity for visible light but a high transmission for 
IR radiation. These characteristics are particularly suited to motion picture or 
slide projectors in order to avoid overheating the photographic emulsion. 
Intense light sources (e.g., carbon arc, xenon lamps) emit IR radiation in 
addition to visible light and the heat generated by the former must be 
dissipated. A cold mirror is thus placed at 45° in front of the light source. The 
heating infrared radiation passes through it while the nonheating visible light 
reflects off to illuminate the object. Metals cannot be used because they are 
good reflectors of the IR. Interference films are required and these must have 
low absorption in the IR. In addition the first film on the glass should be 
material having high reflectance in the visible and transmitting in the IR (e.g., 
Ge or Si). A few alternating A/4 amplifying film layers on top of this help 
achieve the high reflectance over a suitably wide visible bandwidth. 

Heat or dark mirrors have characteristics that are inverse to those of cold 
mirrors (Fig. 11-14). There are two approaches to achieving high visual 
transmittance simultaneously with high IR reflectance. The first is to employ 


COLD MIRROR 



Figure 11-13. Spectral characteristics of a cold light mirror. (From Ref. 19 © 
Laurin Publishing Co. Inc.). 
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HOT MIRROR 



WAVELENGTH IN MICRONS 


Figure 11-14. Spectral characteristics of a heat or dark mirror. (From Ref. 19 © 
Laurin Publishing Co. Inc.). 


interference phenomena in an all-dielectric film stack. The second makes use 
of the properties of transparent conducting films. Consider the application to a 
low-pressure sodium vapor lamp, which consists of a Na-filled discharge tube 
within an evacuated glass envelope. For optimum Na pressure, the discharge 
tube must be kept at a temperature of about 260 °C. The necessary power for 
this is supplied by the gas discharge. However, the tube loses heat through 
radiation of energy in the far IR. Therefore, to conserve energy the inside of 
the envelope is coated so as to enable the (cold) yellow light to emerge while 
reflecting the IR back to the discharge tube. 

In another energy-saving application, home window panes coated with heat 
mirrors would reflect heat back into the house in the winter. In the summer the 
window could be reversed so that the coating could reflect the IR from the sun 
and help provide interior cooling. 


11.4.5. Photothermal Coatings 

The direct conversion of solar radiation into energy for heating or cooling 
applications is a vital component of energy supply and conservation strategies. 
Coatings play an important role in photothermal conversion, and it is appropri¬ 
ate to briefly consider them because of their outward resemblance to the above 
mirrors. They differ because the substrate is usually a heat-absorbing metal 
panel. In addition, they are designed for optimal response to the spectral 
characteristics of sunlight. The situation can be modeled by noting that 
A + R = 1, where A is the coating absorbance. Strong absorption of sunlight 
in the range of 0.3-2.0 fxm is required to heat the substrate. However, a 
portion of the heat will be lost by reradiation from the surface, reducing the 
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overall conversion efficiency. Therefore, a second requirement of the coating 
surface is a low emittance or high reflectivity in the spectrai region of 
reradiation—2-10 fim. Emittance € is defined by the ratio of power emitted 
by a given surface to that of a blackbody. Clearly, higher values of A / e result 
in desired higher equilibrium temperatures reached by the coating. (The 
similarity to the radiation limited temperature reached during sputtering should 
be noted. See p. 117.) 

Solar absorbing coatings have been produced by physical and chemical 
vapor deposition techniques as well as by electroplating, anodization, acid 
dipping, painting, and spraying. Compositions include NiS-ZnS (black Ni), 
Cr-Cr oxide (black Cr), Al 2 0 3 -metal, SiO-metal, PbS, and Zn to name a 
few. Typical absorptances range from 0.90 to 0.98, and emittances of 0.1 are 
common. 


11.4.6. Optical Filters 

Filters are optical components that selectively change either the intensity or 
spectral distribution of light emitted by a source. They can be designed to 
change spectral characteristics over the total, a substantial fraction of the total, 
or only over an extremely narrow portion of the total wavelength range. 
Respective examples of these are shown in Fig. 11-11 and include 

1. Neutral or gray filters, which reduce the light intensity equally for all 
wavelengths. 

2. Broadband, short- or long-wave pass filters. The cold light and heat mirrors 
just described are specific examples. 

3. Narrow bandpass or monochromatic filters. 

Thin film coatings to achieve these ends consist of thin metal films, 
dielectric films, multilayer metal and dielectric film combinations, and all 
dielectric film stacks. These can be deposited on clear and colored glass 
substrates to produce the desired effects. In the very broadest usage of the 
term, filters can be thought to include mirrors and antireflection coatings but 
these optical devices are usually considered separate categories. Since the 
subject is a large one, discussion will be limited. 


11.4.6.1. Neutral Filters. Neutral density filters consist of single metallic 
films of varying thicknesses on glass. They produce the desired uniform 
attenuation of light by reflection and absorption effects. Metals such as Cr, Pd, 
Rh, and Ni-Cr alloys are used for this purpose. The filter is usually character- 
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ized by its optical density, which is defined by log I/I 0 (see Eq. 11-3). 
Important applications of neutral filters can be found in spectroscopy equip¬ 
ment, color photography, and microscopy. They can be fabricated to span the 
visible as well as IR and UV portions of the spectrum. 


11.4.6.2. Broadband Filters. Low- and high-pass edge filters fall into the 
category of broadband filters. They are characterized by an abrupt change 
between a region of high transmission and a region where light is rejected. 
Such an edge band filter is shown in Fig. 11-15, and is used to block out UV 
radiation from a mercury light source. Similar filters can create distinctions in 
light transmission and rejection between the visible and IR and well as across a 
narrow wavelength range entirely within the visible, IR or UV. Filters 
manufactured for the near-IR and visible employ Ag films, whereas A1 is used 
for those operating in the UV. These metals are coated with dielectrics such as 
MgF 2 , PbF 2 , cryolite, and ThF 4 . In the IR, Ge, Si, and Te layers find 
common use. All dielectric multilayer mirror systems can also be used as the 
basis for the design of edge filters, particularly those that require a sharp 
transmission between the pass and stop portions of the transmittance curves. 
The way to sharpen the transition is to increase the number of layers in the 
stack. Unfortunately, the amplitude and frequency of the sideband oscillations 
in the passband also increase when this is done. Suppression of these oscilla¬ 
tions or 4 ‘ripple” is one of the major concerns of filter designers. 

Other common broadband filters consist of colored absorbing glasses in 
combination with interference edge filters or a pair of interference edge filters. 
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Figure 11-15. Mercury lamp light source spectrum and UV blocking filter character¬ 
istics. (From Ref. 5). 
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The latter can be made to have the inverse characteristics of the all-dielectric 
mirror stack—i.e., with high transmission instead of high reflectance, and vice 
versa. Wide ranges of the visible or IR can be selectively filtered this way. 

There are many applications of wide-band filters in color photography, TV 
cameras, color separation schemes, studio illumination, microscopy, etc. We 
close with an additional pair of applications. The first involves using a filter to 
minimize heating of Si solar cells by eliminating the IR component from 
sunlight. Electron-hole pairs are only generated for wavelengths less than 1 
tim and the cell is more efficient when cool. An edge filter with a cutoff 
beyond this wavelength would be called for. Such a filter can be combined 
with an antireflection coating to optimize efficiency. 

A second interesting example involves filters employed in fluorescence 
microscopy (Ref. 5). Sometimes the excitation and emission wavelength bands 
used are so closely spaced that, unless precautions are taken, the two overlap, 
resulting in swamping of the fluorescent light output by the strong source light. 
This happens for example with FTIC, a fluorochrome employed in immunoflu¬ 
orescence. For excitation, maximum absorption occurs at 0.490 ^tm, and the 
emission maximum occurs at 0.520-0.525 pm. An edge filter with an exceed¬ 
ingly high steepness at - 0.500 pm is required. A filter with no less than 31 
Ti0 2 -Si0 2 layers is required to suppress unwanted source radiation to levels 
of — 0.1 % in the region where excitation occurs. 


11.4.6.3. Narrow-Band Fillers. These filters can be traced back to the use 
of the Fabry-Perot interferometer. The optical arrangement involved consists 
of two parallel facing, partially transmitting silver film mirrors separated by an 
air or dielectric layer. Light incident normally on this pair of mirrors is 
strongly transmitted only in a very narrow spectral range. This is a very 
surprising result, since one would expect the mirrors to reflect and filter the 
light; what little light the first allowed to be transmitted would be reflected 
back by the second mirror so that none would get through. This does not 
happen, however. Assume, for example, that the mirrors transmit 2% of the 
light and that 1 W of monochromatic light is incident. If the distance or cavity 
between mirrors is not an integral number of wavelengths long, the light waves 
that penetrate the first mirror will bounce to and fro and soon be out of phase. 
Of the 0.02 W incident on the second mirror, 0.0004 W will eventually be 
transmitted. If the cavity is, however, resonant, all waves will be in phase and 
their amplitudes will add, so that perhaps 50 W will circulate between the 
mirrors. Then 2%, or approximately 1 W, will be transmitted. This effect is 
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relied upon in laser operation. The transmission maxima occur for \ = 2nd, 
where nd is the effective optical thickness of the spacer layer. 

Narrow bandpass filters can be fabricated in virtually any region of the 
spectrum. Figure 11-9 gives us a clue as to what is required. As the refractive 
index of the deposited interference film increases, not only does the reflectance 
increase at X/4 but the region of high transmittance at X/2 narrows consider¬ 
ably. The case where r, = — r 2 = -0.98 combines the high transmittance 
over a narrow range. Two conditions must be fulfilled to achieve this. First the 
optical structure must be symmetric about the spacer layer so that \r x \ = \ r 2 1. 
Second, high reflectance is required at each layer-substrate interface. Metal 
film mirrors can accomplish this but at the expense of some absorption losses. 
A desirable alternative when low loss is essential is to employ an all-dielectric 
film stack. The role of the stratified dielectric structure is to increase the 
reflectance by essentially raising the effective index of refraction as noted 
earlier. 


11.4.7. Conclusion 

In virtually all of the applications in this chapter the individual dielectric films 
have traditionally been modeled solely in terms of two parameters—thickness 
and refractive index. This simple approach will be inadequate in the future 
because of the steadily increasing performance requirements of advanced 
precision optical systems. The gap between theoretically predicted characteris¬ 
tics and performance attained in practice can be narrowed only by modifying 
the basic theory to include second-order effects. These include 

1. Dispersion or the variation of refractive index with wavelength 

2. Small amounts of absorption 

3. Inhomogeneities resulting in the variation of refractive index throughout 
single films 

4. Anisotropy in the refractive index with direction of radiation 

5. Departures from perfectly planar boundaries 

Concurrently, great strides have been made in improving the quality of 
optical materials and in controlling deposition processes. Likewise, characteri¬ 
zation techniques have reached such high degrees of precision that measure¬ 
ments have exposed weaknesses in the theory and design of multilayer film 
systems. Computer-aided interactive feedback integrating theory, design, pro¬ 
cessing and performance of multilayer coatings is essential. In these ways, 
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experience and art, which have so long and so well served the optical coating 
field, are being supplanted by more exact scientific approaches. 



Exercises 



1. Schematically sketch the optical absorption of two semiconductor films as 
a function of wavelength if one film is doped more heavily than the other. 
Is there a difference in absorption at the wavelength corresponding 

to v 

2. a. If the index of refraction of a GalnAsP semiconductor laser is = 

3.52, what is the reflectance at the air interface? 
b. To reduce R at the laser exit window a single-layer AR coating is 
required. What index of refraction and film thickness would you 
recommend for a 1.3-/*m device? 

3- Prove that 

a. without AR coatings, surfaces of higher refracting glasses produce 
higher values of R than those of lower refracting glasses. 

b. higher refracting glasses increase the effectiveness of a single X/4 AR 
layer. 

4. Compare the spectral response of the single AR layer ( n 0 = 1 /«, = 
1.38(X/4)/w 2 = 1.52) and the two-layer AR coating 

K = 1 /«, = 1.38(\/4)/ n 2 = 1.70(X/4)/« 3 = 1.52) 

by calculating R at X = 500, 550, and 600 nm for each. [Note: The X/4 
layers are selected for X = 550 nm, and n is assumed to be independent 
of X.] 

5. A 7.5-cm-long glass slide substrate of index of refraction n 2 = 1.5 is 
coated with ZnS for which n l = 2.3. Graph the expected percent re¬ 
flectance at 0.55 fxm as a function of position along the slide if 

D 

a. a uniform 2000-A film is deposited. 

b. a wedge-shaped film (zero thickness at one end, 2000-A thick 10 cm 
away) is deposited. 

c. an evaporated film is deposited from a surface source 10 cm directly 

o 

below the center of the slide. The maximum film thickness is 2000 A. 
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6- For an additional charge lenses on eyeglasses are coated. How does this 
enhance wearer personal appearance and vision or extend lens life? 

7. A protected A1 mirror is characterized by n 0 /n { / n 2 - ik 2 , with n 0 = U 
n x = 1.52, n 2 = 0.81, and fc 2 = 5.99 at X = 0.55 ^m. 

a. Calculate r x , r 2 , and R. 

b. What is the reflectance of the mirror? 

c. How does R for an unprotected mirror compare with the answer to 
part (b)? 

8- A step gauge consisting of thermal Si0 2 films on a Si substrate, varying 
in thickness from 200 to 5000 A is viewed with a HeNe laser (X = 6328 

o 

A). If the refractive index of Si is N = 4.16 - /0.018, plot the re¬ 
flectance versus Si0 2 film thickness. 

9. A thin amorphous Si (a-Si) film (n a - ik a ) on a (100) Si (c-Si) substrate 
(n c - ik c ) shrinks in thickness during solid-phase epitaxial regrowth at 

o 

elevated temperature. A He-Ne laser (X = 6328 A) probe beam reflects 
from both the surface and a-c interface establishing interference effects in 
the backscattered optical signal. 

a. Show that the reflectivity for any given a-Si film thickness, d> is given 
by 

f ^ f g-*'*k a df\g-i4Trn a dl\ ~ 2 

ft __ J_?_ 

1 + r l r 2 e~ Mad e~ i4lrnad/x 

b. If N a = 4.85 - /0.61 and N c — 4.16 - /0.018, calculate i? as a 

o 

function of d over the range 4000 to 0 A. Characterize the resultant 
reflectivity oscillations. [Note: When d > 4000 A, there is essentially 
no contribution from c-Si and R = 0.438. At d = 0, R = 0.375.] 

c. Suppose the layers are reversed and a film of c-Si is at the surface on 
top of a thicker a-Si substrate layer beneath. How does the R vs. 
tf(c-Si) dependence differ from the R vs. <i(a-Si) dependence of the 
previous case? 

10. Ion bombardment deposition of a (HLH, etc.) multilayer dielectric stack 
of nine layers of ZnS and MgF 2 on glass (n 2 = 1.52) raises each of the 
respective refractive indices by 4%. What change in reflectivity can be 
expected for such a structure relative to a traditionally evaporated stack? 
What if there were only five layers? 
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11. The inner surface of an incandescent lamp bulb is coated with a thin-film 
sandwich consisting of 

ZnS (0.03 jim thick) 

Ag (0.02 jim thick) 

ZnS (0.03 jim thick) 

Explain the function of these layers and the overall behavior of the lamp. 

12. Explain why the thin-film coating consisting of air/SiO(X/4)/Ge(X/4)/ 
opaque Al/glass substrate has a reflectance-wavelength response as fol¬ 
lows: 


Wavelength 

%R 

Visible 

- 2 

1 /im 

80 

1.2 n m 

90 

> 2 

> 95 


13. Calculate R for the following dielectric stacks. 


System 

No. of Layers 

n Substrate 

H.L. 


SH 

1 

1 

Ce0 2 ,Na 3 AlF 6 

0.55 

SHLH 

3 

1.52 

Ce0 2 ,Na 3 AlF 6 

0.55 

SHLHLH 

5 

1.52 

Ce0 2 ,Na 3 AlF 6 

0.55 

SHLHLH 

5 

1.45 

Ge, Na 3 AlF 6 

2.0 

SHLHLHLH 

7 

1.50 

ZnS, Na 3 AlF 6 

0.59 
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Chapter 12 




Metallurgical and 
Protective Coatings 


12-1. Introduction 


Paralleling the dramatic development of thin-film technology in microelectron¬ 
ics have been the no less than remarkable advances in what may be conve¬ 
niently called metallurgical and protective coatings. The unusual materials 
which comprise these coatings are drawn from several classes of solids and 
include ionic ceramic oxides (e.g., A1 2 0 3 , Zr0 2 , Ti0 2 ), covalent materials 
(e.g., SiC, BC, diamond), transition metal compounds (e.g., TiC, TiN, WC) 
and metal alloys (e.g., CoCrAlY, NiAl, NiCrBSi). As a whole they are 
characterized by extremely high hardness, very high melting points, and 
resistance to chemical attack, attributes that have earmarked their use in critical 
applications where one or more of these properties is required; correspond¬ 
ingly the respective categories of hard, thermal, and protective coatings denote 
the functions to which they are put. Hard coatings of TiN and TiC, for 
example, are used to extend the life of cutting tools, dies, punches, and in 
applications such as ball bearings to minimize wear. The collection of coated 
cutting tools and dies shown in Fig. 12-1 is representative of the widespread 
commercial use of this technology in machining and forming operations. 
Thermal coatings find extensive use in gas turbine engines where they help to 
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Figure 12-1- (Left) Assorted cutting and forming tools coated with TiN and multi¬ 
layer coatings. (Courtesy Multi-Arc Scientific Coatings). (Top right) HSS forming and 
sheet metal dies coated with TiN and TiC. (Courtesy Ti Coating Inc.) (Lower right) 
multilayer coated cutting tool inserts. (Courtesy of S. Wertheimer, ISCAR Ltd.) 


improve the performance and extend the life of compressor and turbine 
components. As the name implies, protective coatings are intended to defend 
the underlying materials, usually metals, from harsh gaseous or aqueous 
environments that cause corrosive attack. Such coatings have found applica¬ 
tions in chemical and petroleum industries, coal gasification plants, as well as 
in nuclear reactors. 

Employing coatings represents a significant departure from traditional engi¬ 
neering design and manufacturing practices. Processing components beyond 
the primary manufacturing steps of casting, forging, extrusion, machining and 
grinding, pressing and sintering, etc., has generally been resisted. This is due 
in part to a reluctance to tamper with the product, and to leave well enough 
alone. A compelling case was not made for the cost effectiveness of additional 
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treatments. However, more recently several important factors have combined 
to firmly establish the practice of modifying the surface properties of engineer¬ 
ing materials and components. 

1. In many critical applications the design specifications call for properties that 
are simply beyond the capabilities of the commonly available and routinely 
processed materials. The new limits of behavior demanded can be met by 
the use of the unusually hard, temperature- and degradation resistant 
materials noted earlier. However, these materials are extremely difficult to 
fabricate in bulk form. 

2. Concerns of limited availability of strategic materials, the thrust toward 
energy efficiency and independence, and an increasingly competitive world 
economy have exerted a strong impetus to considerably tighten engineering 
design, improve performance, and economize on materials utilization. 

3. High-quality coatings possessing fewer surface imperfections than compara¬ 
ble pressed and sintered bulk parts made from powder, can now be 
reproducibly deposited. This is due to the advances made in our basic 
understanding of the deposition processes and the development of improved 
coating and deposition techniques. 

4. The commercial availability of the necessary deposition chambers or reac¬ 
tors, hardware, computer-controlled processing equipment, and high-purity 
sources of precursor gases, powders and sputtering targets has facilitated 
the option of employing coatings. 

Various combinations of the above factors have then resulted in the marriage 
of coatings to the underlying base materials, each with their particular set of 
desirable and complementary properties. For example, many structural materi¬ 
als with adequate high-temperature mechanical properties simply do not have 
the ability to withstand high-temperature oxidation, corrosion, particle erosion, 
and wear. On the other hand, the materials that do possess the environmental 
resistance either do not quality as structural materials because of low toughness 
or, if they do, are prohibitively expensive to fashion in bulk form. 

Before we turn to the main subjects of the chapter, it is worth noting some of 
the similarities and differences between the present mechanically and environ¬ 
mentally functional coatings, and the thin films of prior book chapters. In 
common, many coatings are deposited by the same type of physical (PVD) and 
chemical (CVD) vapor deposition techniques. Adhesion to the substrate, 
development of desirable structure and properties, and meeting performance 
standards are universal concerns. Among the differences are the following: 

1. The coatings we will be considering are far thicker than thin films. Whereas 
a couple of microns, at most, is the arbitrary upper limit to what we have 
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called films, coatings typically range from several to tens and even hun¬ 
dreds of microns in thickness, 

2. The maintenance of precise coating thickness and uniformity is not usually 
a major concern. There is generally a broad range of acceptable coating 
thicknesses. This is in contrast to the critical thickness tolerances and 
uniform coverage that must be achieved in optical and microelectronic 
films. 

3. The substrate is frequently an integral part of the coating system. In 
diffusion coatings, for example, metalloid as well as metal elements are 
diffused into the substrate, creating thick, solute-enriched layers beneath the 
surface. 

4. For the most part, the substrates employed for hard and protective coatings 
are rather special metals and alloys such as tool, high-speed, and stainless 
steels; iron-, cobalt-, and nickel-base superalloys; sintered tungsten carbide; 
titanium, etc. The use of the term metallurgical coating is based in part on 
this fact. 

5. There are many methods for producing coatings. In addition to the vapor 
phase atomistic deposition processes (PVD, CVD) for films, coatings are 
also formed by 

a. Deposition of particulates (e.g., by thermal spraying of metal or oxide 
powders either through a hot flame or an even hotter plasma) 

b. Immersion of substrates in molten baths or heated solid packs 

c. Electrolytic processes such as electroplating, fused salt electrolysis, and 
electroless plating 

d. Miscellaneous processes, e.g., welding and enameling 

6. Except for epitaxial semiconductor films, most thin-film depositions are 
carried out at relatively low-substrate temperatures. Metallurgical and 
protective coatings, however, are frequently deposited at elevated tem¬ 
peratures. Certainly this is true of the CVD coatings, and, therefore, 
atomic interdiffusion and reactions generally occur at the interface 
between coating and substrate. Compositional change can either be 
beneficial or detrimental to adhesion and coating properties depending 
on the materials involved. 

The bulk of the chapter will be concerned with hard coatings and issues 
related to them. Somewhat lesser emphasis is placed on thermal and environ¬ 
mental coatings. To limit the treatment to manageable proportions, we deal 
with properties and the phenomena they influence in a fundamental way. The 
more widely used vapor deposition processes will be primarily discussed to 



12.2 Hard Coating Materials 


551 


maintain a consistency with prior chapters. Electrodeposition, for example, 
will not be mentioned again, since there is already a huge and accessible 
literature on the subject. Case histories and examples are always interesting 
and will be interspersed where appropriate. The specific topical outline of the 
rest of the chapter is 

12.2. Hard Coating Materials 

12.3. Hardness and Fracture 

12.4. Tribology of Films and Coatings 

12.5. Diffusional, Protective, and Thermal Coatings 


12.2. Hard Coating Materials 


12.2.1. Compounds and Properties 

Hard coating materials can be divided into three categories, depending on the 
nature of the bonding. The first includes the ionic hard oxides of Al, Zr, Ti, 
etc. Next are the covalent hard materials exemplified by the borides, carbides, 
and nitrides of Al, Si, and B, as well as diamond. (See Section 14.2.) Finally, 
there are the metallic hard compounds consisting of the transition metal 
borides, carbides, and nitrides. Typical mechanical and thermal property 
values for important representatives of these three groups of hard materials are 
listed in Table 12-1. The reader should be aware that these data were gathered 
from many sources (Refs. 1-6) and that there is wide scatter in virtually all 
reported property values. Differences in processing (e.g., CVD, PVD, and 
sintering of powders), variations in structure (e.g., grain size, porosity, 
density, defects) and composition (e.g., metal-nonmetal ratio, purity), to¬ 
gether with statistical error in measurement, contribute to the uncertainties. 
Perusal of this tabulated information leads to the following broad conclusions: 

1. All of these compounds have extremely high hardnesses. This can be 
appreciated by noting that heat-treated tool steel has a hardness of about 
H v = 850. Hardness is the most often quoted material property of hard 
coatings. Therefore, Section 12.3 has been specially reserved for an 
extensive discussion of the concept of hardness, the technique of measure¬ 
ment, and the significance of its magnitude in coatings. 

2. These compounds have very high melting points and decomposition temper¬ 
atures. For example, the decomposition temperatures of TaC, HfC, and 
diamond exceed the melting point for tungsten (MP = 3410 °C). 

3. The modulus of elasticity is lowest for the ionic solids. In comparison, only 
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the stiffest metals have modulus values overlapping those of the oxides 
listed. 

4. The linear thermal expansion coefficient generally increases in going from 
the covalent to metallic to ionic hard compounds. Metals tend to have 
thermal expansion coefficients that are higher by approximately a factor of 
two or more than these hard compounds. 

5. The thermal conductivity of the hard metallic and covalent compounds is 
comparable to that of the transition metals and their alloys. Good metallic 
electrical conductors have proportionally higher thermal conductivities. 
Ceramic oxides are the poorest thermal conductors. 

The last two properties have important implications for the properties and 
performance of coatings. An important source of coating residual stress is the 
thermal contribution generated by the difference in expansion between coating 
and substrate. The illustrative problem dealing with TiC on steel (p. 419) is 
worth reviewing and indicates the magnitude of possible effects. 

The susceptability to cracking of coatings subjected to varying temperature 
histories is an important limitation to the performance of thermal coatings. To 
see how thermal cracking can occur, consider the rapid cooling of a high-tem¬ 
perature component. The surface coating contracts more than the interior, 
which is still relatively hot. As a result, the surface forces the interior into 
compression and is itself stretched in tension. The coatings we have been 
discussing are weak in tension and prone to fracture at incipient crack flaws. A 
measure of this fracture susceptibility is given by the thermal shock parameter 
S T . This composite quantity is derivable from heat transfer and thermal stress 
considerations that are defined respectively by the equations 

Q= -k(AT/Ax) (12-1) 

and 

a = (EAa AT )/(I - p). (12-2) 

In the first equation the heat flux Q is given by the product of the thermal 
conductivity k and the temperature gradient. The second equation was consid¬ 
ered previously (Eq. 9-17), where Ac* is the difference in thermal expansion 
coefficient between coating and substrate. Elimination of AT yields 

a EAa 

5-1 _ _ = _ 

T QAx (1 - v)k ' 

Therefore, for a given thermal flux through a coating of thickness Ax, the 
stress that develops is inversely proportional to the thermal shock parameter. 
Ceramic oxides with the lowest S T values are most prone to fracture by 


(12-3) 
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thermal shock, and metals with the largest S T values are least susceptible. The 
hard metal and covalent compounds are intermediate in this regard. 


12.2.2. Deposition Processes (Ref. 7) 

Hard coatings have now been commercially deposited on perhaps hundreds of 
millions of cemented tungsten carbide cutting tools by CVD methods since the 
mid-1960s. However, advances in physical vapor deposition methods in the 
past decade have led to the fruitful competition in many applications between 
these two broad deposition processes. It is instructive to briefly review and 
compare several aspects of these processes as they relate to hard coatings. The 
more extensive discussion of CVD and PVD in Chapters 3 and 4 provide the 
appropriate background. 

Since hard coatings are deposited on varied substrates (e.g., carbides, 
high-speed tool steels, ball-bearing steels, watch cases, jewelry, etc.), a range 
of substrate temperatures is involved, necessitating different deposition tech¬ 
niques and conditions. For example, TiN can be deposited by CVD over a 
broad range of temperatures using the same TiCl 4 precursor. Some typical 
reactions leading to the attractive, lustrous golden TiN deposits are 

1. High temperature: 1200 °C > T > 850 °C 

2TiCl 4 + N 2 + 4H 2 -* 2TiN + 8HC1. 

2. Moderate temperature: 850 °C > T > 700 °C 

TiCl 4 4- CH 3 CN 4- (5/2)H 2 -+ Ti(C,N) 4- CH 4 + 4HC1. 

3. Low temperature: 700 °C > T > 300 °C 

2TiCl 4 4- 2N 2 + 7H 2 4- Ar plasma 2TiN 4- 8HC1 4- 2NH 3 . 

Analagous reactions involving gaseous carbon compounds, such as CH 4 result 
in TiC deposits. Precursor gases and operating temperatures for these as well 
as other coatings have been given previously in Chapter 4. 

One of the important advantages of CVD methods is the ability to batch-coat 
large numbers of small tools at one time. The schematic of such a commercial 
system capable of individual or sequential TiC, TiN, and A1 2 0 3 depositions is 
shown in Fig. 12-2a. Typical gas feed proportions for these materials are 

TiC-TiCl 4 (1-2%); CH 4 (3-6%); H 2 (96-92%) 

TiN-TiCl 4 (1-2%); N 2 (22-26%); H 2 (77-72%) 

A1 2 0 3 -A1C1 3 (0.18-0.3%); CO (0.2-0.5%); H 2 (99%) 
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Figure 12-2. (a) Schematic view of commercial CVD reactor for deposition of TiC, 

TiN and A1 2 0 3 on carbide cutting tools, (b) Cathodic-arc process for reactive evapora¬ 
tion of TiN coatings. (Courtesy of A. Gates, Multi-Arc Scientific Coatings Inc.) 
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Temperatures within the range 700-1100 °C are maintained to ±2 °C. As 
many as 10-20,000 WC tool inserts (1/2 in square) can be simultaneously 
given a — 6-^m-thick coating during a several-hour treatment cycle. The 
fracture surface morphologies of single-layer CVD TiC, TiN and TiCN as well 
as HfN coatings deposited on cemented carbide substrates are shown in Fig. 
12-3. Rough transverse fracture surfaces containing columnar-like grains rising 
from a sharply defined substrate interface, typically characterize these coating 
structures. In the coating plane a mountainous relief of jagged crystals is 
evident. 

The three basic PVD coating processes are evaporation, sputtering, and ion 
plating together with reactive variants incorporating chemical reactions (Refs. 



Figure 12-3. Morphology of representative CVD coatings viewed in the SEM. 
Top-Transverse fracture surfaces of TiC and TiCN. Middle-Transverse fracture sur¬ 
faces of TiN and HfN. Bottom-Lateral surface structures of TiN and HfN. Substrate is 
cemented carbide. (Courtesy of D. T. Quinto, Kennametal, Inc., from Ref. 8). 
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8, 9). An important advantage of PVD is that substrate temperatures in the 
range of 200-600 °C, rather than 800-1200 °C for CVD, are utilized. This 
leads to somewhat finer coating grain sizes possessing slightly higher hard¬ 
nesses. Because of the high deposition temperatures in CVD, substrate steels 
are sometimes softened, requiring subsequent hardening and tempering treat¬ 
ments. Undesirable part distortion occasionally results from this sequence of 
thermal treatments. In addition, substrate elements such as iron, chromium, 
and carbon diffuse into the coating altering adhesion as well as substrate 
integrity. These difficulties do not arise in the low-temperature PVD coatings. 
An added advantage of commercial PVD processes is the generally higher 
deposition rates of 4-6 jxm/hour relative to typical CVD rates of 1-2 
/xm/hour. Nevertheless, throughput of small parts is substantially greater for 
CVD because substrate coverage is not limited by line-of-sight gas impinge¬ 
ment as in PVD. On the other hand, the hazard of handling toxic and corrosive 
gases is considered a disadvantage of CVD processes. For larger tools such as 
drills and milling cutters, the PVD techniques of reactive evaporation and 
sputtering are widely employed. Commercially, uniform coating of tools and 
dies is accomplished through multiple evaporators arranged to surround the 
work as shown in Fig. 12-2b. While single (e.g., TiC), double (e.g., TiC-TiN) 



TiN-AE 




Figure 12-4. Morphology of PVD coatings of TiN prepared by ion plating (IP) and 
arc evaporation (AE) viewed in the SEM. Top-Transverse fracture surfaces. 
Bottom-Lateral surface structures. Substrate is cemented carbide. (Courtesy of D. T. 
Quinto, Kennametal, Inc., from Ref. 8). 


558 


Metallurgical and Protective Coatings 


and even triple (e.g., TiC-Al 2 0 3 -TiN) hard layers are commonly produced 
by CVD, commercial PVD coatings are largely limited to single TiN layers. 
Transverse fracture and lateral surface morphologies of TiN prepared by two 
different PVD processes are reproduced in Fig. 12-4. The smooth fracture 
surfaces composed of densely packed, fine columnar grains should be com¬ 
pared with the microstructures of CVD TiN (Fig. 12-3). Differences in grain 
morphology can be attributed to varying conditions of deposition temperature 
and energetic bombardment by vapor phase atoms. 


12.2.3. Alloy Compounds (Refs. 3, 10) 

Just as metallic alloys display properties superior to each of the component 
elements, so there has been considerable interest in improving the properties of 
the transition metal compounds through alloying. Fortunately, this is possible 
because the binary carbides and nitrides tend to be miscible in the solid state. 
For example, at elevated temperature, TiC reacts with VC, ZrC, NbC, HfC, 
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Figure 12-5. Microhardness of mixed carbides due to solid solution and precipitation 
hardening (From Ref. 3). 
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and TaC to produce mixed carbide solid solutions, such as (Ti, V)C, (Ti, 
Zr)C, (Ti, Nb)C. Since these carbides have the same crystal structures, 
TiC-MC (M = V, Zr, etc.) phase diagrams resemble the one for Ge-Si 
shown in Fig. 1-12, at least at high temperatures. Similarly, TiN can be 
alloyed with VN, ZrN, NbN, and HfN to produce solid solutions of mixed 
nitrides. The increase in hardness for the mixed carbides can be quite exten¬ 
sive, as shown in Fig. 12-5. Substitution on the metal sublattice results in 
maximum hardening at specific alloy concentrations. Similar increases in 
hardness have been observed in the mixed metal nitride solid solutions as well 
as in metal carbonitrides, e.g., Ti(C, N), Hf(C, N). Property benefits also 
extend to oxide alloys. For example, aluminum oxynitride has greater chemical 
and thermal stability than A1 2 0 3 alone and is used to coat tools. 

Quaternary alloys arising from mixtures of carbides and nitrides of different 
metals (e.g., Ti-V-C-N) have also been investigated and appear to provide a 
promising basis for the selection of yet other new composite coating materials. 
Subsequent heat treatment of the alloyed carbides, nitrides, and carbonitrides 
can lead to complex low-temperature phase transformations and precipitation 
reactions. Dispersions of hard phases such as TiB 2 or B 4 C in TiC or TiN 
matrices have produced coatings that show better wear resistance and adhesion 
as well as higher toughness than single-phase layers. 


12.2.4. Multilayer Coatings (Refs. 3, 11) 

Distinct from multicomponent single layers is the multilayer concept. 
Several strategies have been adopted in creating composite multilayers: 

1. Mutually soluble layers, e.g., TiC-TiN or A1 2 0 3 -A1N, are sequentially 
deposited. Alloys with enhanced properties are produced this way. 

2. Compounds possessing coherent interfaces such as TiC-TiB 2 or TiN-TiB 2 
are sequentially deposited. Adhesion between layers is promoted by low 
energy bonding between common elements. 

3. Separate layers of largely noninteracting compounds are sequentially de¬ 
posited. Each layer has a specific purpose, such as interfacial adhesion, 
diffusion barrier, abrasion resistance, etc. 

The last approach is one that has attained considerable commercial success. 
Cross sections of such coatings on WC inserts for assorted cutting tools are 
shown in Fig. 12-6. They contain some combination of TiC, TiN, and A1 2 0 3 
layers deposited by CVD and are typically 6-9 ^m thick. The multilayer of 
Fig. 12-6a is tough and suitable for rough, semifinishing, and interrupted 
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Substrate 


S.E.M. x5500 



Substrate 


S.E.M. x3500 

Figure 12-6. SEM images of CVD multilayer coatings for cutting tool inserts, (a) 
Carbide substrate/TiC/TiCN/TiN (5500 x ). (b) Carbide substrate-TiC-Al 2 0 3 -TiN 
(3500 x ). (Courtesy of S. Wertheimer, ISCAR Ltd.) 

machining of plain carbon, alloy, tool, and stainless steels. The multilayer of 
Fig. 12-6b containing A1 2 0 3 has many of the same characteristics but is 
capable of cutting at higher speeds without tool failure. Because of its low 
thermal conductivity, A1 2 0 3 shields the cemented carbide substrate and en¬ 
ables the generated heat to be diverted to the chip instead. Among the reasons 
for the use of TiN as the outer coating layer are its high lubricity, chemical 
stability, and the lack of adherence of abrasive oxide particles to it. As 
indicated in Section 12.4.3 tools coated with multilayers suffer considerably 
less wear than tools with only single-layer TiC or TiN coatings. 
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12.3. Hardness and Fracture 


12.3.1. Hardness 

Hardness is an important material property of concern in films utilized for 
electronic and optical as well as mechanically functional applications. It affects 
wear resistance and plays an important role in the friction and lubrication of 
surface films in contact. Hardness, a complex property related to the strength 
of interatomic forces, apparently depends on more than one variable. Hard 
materials are generally modeled by a deep potential energy well (Fig. l-8b) 
with steep walls. These characteristics imply a combination of high cohesive 
energy and short bond length. Measures of the cohesive energy are the heat of 
sublimation and enthalpy of compound formation (A// 298 ). If some account is 
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Figure 12-7. Relation between hardness and cohesive energy per molar volume 
expressed in terms of heat of sublimation (O-Left axis) and heat of formation (e-Right 
axis). 
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taken of the bond length by dividing the cohesive energy by the molar volume 
of the material, then the correlations with hardness shown in Fig. 12-7 result. 
It is apparent that the hardest materials are covalently bonded, and that 
increasing the ionic character of the bond leads to reduced hardness. At a 
microscopic level, directional bonds more readily resist distortion and rupture 
by concentrated loads than do ionic bonds. 

The hardness of a material is usually defined as its resistance to local plastic 
deformation. Since the hardness test consists of pressing a hard indenter into 
the surface, it is equivalent to performing a highly localized compression test. 
A correlation between hardness H and yield strength a y is therefore expected. 
Typically H * 3 o y in bulk metals, but the correlation has not been directly 
verified in films or hard coatings. It is well known that a y for bulk materials 
decreases with temperature, a fact that facilitates hot mechanical forming 
operations. Therefore, it can be anticipated that the hot hardness of coatings 
will be lower than that at ambient temperatures for which values are usually 
quoted. It is these hot hardness values that are important in thermal coatings 
and in applications such as machining. A functional relationship between H 
and T given by 

H = H 0 e~ aT (12-4) 

has been proposed (Ref. 6). Values for the constants H 0 and a, determined 
from experimental data, are given in Table 12-1. 


12.3.2. Hardness Testing 

Hardness testing of films and coatings is a relatively simple (though decep¬ 
tively so) measurement to perform. The most frequently employed methods are 
modifications of techniques having long standing in the metallurgical commu¬ 
nity. The Vickers hardness test, also known as the diamond pyramid hardness 
(DPH) test, employs an indenter consisting of a square-based diamond pyramid 
ground to have a face angle of 136°. H vi the Vickers hardness number, is 
obtained as the ratio of the applied load L to the surface area of the resulting 
indentation. It is given by 

L 1.854 L 

H v = 2 cos 22° = kg/mm 2 , (12-5) 

where l v is the indentation diagonal. A related test is the Knoop microhardness 
test, which employs a rhombic-based diamond pyramid indenter where the 
length ratio of the major to minor indentation diagonal is 7.11; furthermore, 
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the minor diagonal is four times the penetration depth. The Knoop hardness is 
given by 

H k - 14.22 Ljl\ kg/mm 2 (12-6) 


where l k is the length of the major indentation diagonal. Care must be taken in 
distinguishing H v and H k values, which are sometimes used interchangeably, 
even though the former are some 10-15% lower than the latter. 

Testing loads applied to the indenter are selected to produce an indent whose 
lateral dimensions are large enough to be measured with an optical microscope 
or SEM, and whose depth is small compared with the film thickness. Caution 
must be exercised so that the plastically deformed zone under the indenter does 
not extend to the substrate where stress interactions may be reflected in 
erroneous hardness readings. Film-thickness-to-indentation-depth ratios rang¬ 
ing from 5 to 10 appear to be safe in most situations. For example, suppose 
that in TiN coatings, a value of 7 has been established as safe. In the Vickers 
test this means that the indent geometry requires that the film thickness must be 
at least larger than l v . For a TiN coating where H v = 2000 kg/mm 2 , a 10-g 
load generates an l v value of - 3 /xm. This would be the minimum acceptable 
film thickness for hardness testing in this case. Unusually small indents of less 
than a few microns in size should be avoided because of the difficulty in 
resolving dimensions. In TiN films, for example, the same indent measured by 
the same investigator optically and with an SEM gave values of 3238 and 2359 
kg/mm 2 , respectively (Ref. 12). Hardness comparisons should, therefore, only 
be made when testing methods and film thicknesses are maintained approxi¬ 
mately constant. 

Further complications arise in interpreting hardness values in hard materials. 
First, indentation-induced cracking phenomena make it difficult to measure 
indent dimensions precisely. Second, and perhaps more importantly, the 
quoted hardness values are load-dependent. For small loads, it is observed that 
the indentation dimension / varies with load as L = K x l m , where K x is a 
constant and 1 < m < 2. Elimination of l v (or l k ) in Eq. 12-5 (or 12-6) yields 


dH m - 2 ^ ( Ki \ 2/m 
~dL = m Kl \~L ) 


(12-7) 


where K 2 is either 1.854 or 14.22. A drop in hardness with increasing 
indenter load is thus predicted because m - 2 is negative. In polycrystalline 
SiC, WC-Co cemented carbide, and bearing steels, m equals 1.70, 1.88, and 
1.92, respectively (Ref. 13). It is this load dependence of hardness that may be 
partially responsible for claims of unusually outstanding hardness values in 
films and coatings. 
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It is appropriate to end this section with brief mention of nano, rather than 
micro , hardness indentation measurements. The Nanoindenter, which was 
introduced previously (Section 9.2.4) is capable of reproducibly making inden- 

O D 

tations as shallow as 500 A and as small as 5000 A across. Unlike the Vickers 
and Knoop tests, the Nanoindenter produces a triangular pyramidal indenta¬ 
tion. Small load ranges of 0-20 mN and 0-0.120 mN, make it possible to 
space indents — 1 /im apart when determining the hardness profile across a 
coating surface. 

12.3.3. Effect of Microstructure on Hardness 

A fundamental cornerstone of materials science is the relationship between 
microstructure and properties. Following Sundgren and Hentzell (Ref. 4) 
connections between the hardness of films and coatings and various microstruc- 
tural characteristics will be discussed. 


12.3.3.1. Grain-Size and Grain-Boundary Structure . In metal and alloy 
films the hardness correlates well with the Hall-Petch relation (Eq. 9-4), 
which may be written in the variant form 

H = Hj + K H r g ]/ \ ( 12 - 8 ) 

where H is the hardness, H i is the intrinsic hardness of a single crystal, / is 
the grain size, and K H is a material constant. In refractory compound 
coatings, however, Eq. 12-8 appears to be invalid because the hardness of 
fine-grained films is frequently close to the bulk material value. 

What does apparently affect the hardness and strength of refractory com¬ 
pounds is the perfection of the grain boundaries. Porosity and fine microcracks 
are very deleterious to such coatings and lower their strength and hardness 
significantly. Metal films are, however, somewhat more tolerant of such 
defects, whose influence can be blunted by plastic deformation effects. Increas¬ 
ing the substrate temperature is the simplest and most common way to reduce 
the grain-boundary defect structure and enhance the hardness of these com¬ 
pounds. The effect can be rather dramatic. For example, an increase in 
substrate temperature from 100 to 600 °C raised the hardness of magnetron- 
sputtered TiN from 1300 to 3500 kg/mm 2 (Ref. 14). Elevated temperatures 
eliminate void networks and evidently promote the strengthening of grain 
boundaries. The grain size increases as does hardness, contrary to the 
Hall-Petch prediction. With further grain growth at still higher temperatures, 
the hardness drops and reaches bulk levels. Metal films display a rather 
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different character with respect to the influence of substrate temperature. 
Raising the latter increases thermal stresses and grain size, but reduces the 
intrinsic stress and hardness. 


12.3.3.2. Metastable Structures. Metastable phases are frequently ob¬ 
served in refractory compound films. As in the case with metastable metal 
alloy films, high deposition rates and low substrate temperatures are conducive 
to the formation of nonequilibrium structures and a fine grain size. Manifesta¬ 
tions of the metastability are the incorporation of C and N in interstitial lattice 
sites and the generation of supersaturated solid solutions. This generally occurs 
during PVD rather than CVD, which is usually carried out under conditions 
closer to thermodynamic equilibrium. The incorporated interstitials tend to 
distort the lattice and the subsequent difficulty in initiating dislocation motion 
is reflected in increased hardness. The effect can be quite large. Hardnesses of 
2500 to 3500 kg/mm 2 are generally found in reactively sputtered HfN films 
compared to 1600 in CVD-grown films. Apparently the bombardment of the 
growing HfN film by the sputtering gas forces N into tetrahedral interstitial 
positions. This creates a high compressive stress in the plane of the film and 
thus a higher hardness. 

In general, metastable phases and structures can be frozen in up to tempera¬ 
tures of 0.3 T m (T m is the melting point). As a consequence, metastable hard 
coating systems can be used at temperatures up to 550-800 °C (for 
2500-3300 °C) (Ref. 10). 


72.3.3.3. Impurities . Since hard PVD coatings are generally grown in 
medium vacuum or under even higher pressure ambients, the incorporation of 
noble gases, C, N, and O from residual gases, and impurities from chamber 
hardware and walls is not uncommon. The deposit impurities are located in 
both substitutional, interstitial as well as grain-boundary sites at total levels up 
to a few atomic percent. Even at such low concentrations, the effect on 
hardening can be pronounced. The mechanism of hardening due to impurities 
apparently involves the electrostatic attachment of the latter to charged disloca¬ 
tions in ionic materials and to dangling dislocation bonds in covalent com¬ 
pounds. Such interactions limit dislocation mobility by pinning effects. 


72.3.3.4. Film Texture . By texture we mean the preponderance of one (or 
more) crystallographic planes oriented parallel to the film surface compared 
with the case of randomly oriented planes. In the latter, isotropic behavior may 
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be expected. Films grown by PVD and CVD techniques usually display a 
preferred orientation, however, with low index planes lying parallel to the 
substrate surface, creating a texture that is strongly dependent on virtually all 
deposition and process variables. Factors of as much as 2 in hardness have 
been observed in cubic coatings (e.g., TiC, TiN, ZrC) as a function of the 
preferred orientation plane [e.g., (Ill), (100) and (110)]; similar hardness 
anisotropy on basal (0001) and prismatic (1100) planes of hexagonal materials 
(e.g., WC, SiC) has been reported. 

12.3.4. Fracture 

It is a fact of nature that materials that are extremely hard are simultaneously 
brittle and prone to fracture. The phenomenon of fracture is of cardinal 
concern in a great many materials engineering applications; once fracture of a 
component or structure occurs, other issues quickly assume secondary impor¬ 
tance. Films and coatings are no exception. For example, with the exception of 
oxidation wear, all wear mechanisms are based on some sort of crack 
development that creates new surfaces, from which particles can be detached 
by fracture processes. In a similar vein, high-temperature fracture or spalling 
of coatings leaves the underlying substrate unprotected and at the mercy of 
harsh corrosive atmospheres. 

At the outset it is important to distinguish between brittle and ductile 
fracture. The latter occurs after the material has undergone some plastic 
deformation. Metals tend to undergo ductile fracture upon overloading. Brittle 
fracture, on the other hand, occurs rapidly, without warning and in such a way 
that the broken pieces can usually be neatly fitted together. Generally, brittle 
fracture occurs more readily in materials having small tensile strengths com¬ 
pared with their compressive strengths. Thus, glasses, ceramic oxides, and 
covalent, as well as hard metal compounds, to a lesser extent, are particularly 
prone to brittle fracture. 

Two approaches to fracture of coatings will be presented next. The paramount 
assumption is that the materials involved possess an intrinsic collection of 
structural flaws distributed laterally as well as through the coating thickness. 
Voids, porosity, interconnected voids or porosity, voided or grooved grain 
boundaries, local regions of de-adhesion, etc. may be viewed, in a broader 
context, as flaws and even incipient cracks. Under either external or residual 
internal stressing, each flaw will locally concentrate stress and the surrounding 
material will tend to deform. If the stresses are sufficiently large, they can 
ultimately destroy the coating by crack propagation if tensile, and by wrinkling 
or buckling if compressive. 
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12.3.5. Statistical Strength of Brittle Materials 

A useful concept for interpreting the strength of brittle solids is that of the 
“weakest link.” Under load the worst flaw is assumed to govern the strength 
and useful life of the object. Since the flaw size geometry and distribution 
varies from sample to sample, there will be a distribution in fracture strength. 
The Swedish engineer Weibull suggested a way of handling the statistics of 
strength. He defined the survival probability P( V 0 ) as the fraction of identical 
samples each of volume V 0 that survive loading to a tensile strength a, 

P(K 0 ) = exp[-(cr/(7 / ) m ], (12-9) 

where oy and m are constants. When a = 0, then P{ V Q ) = 1 and all samples 
survive. At large stress levels all samples fail and P(K 0 ) -+ 0. When a = oy , 
then P(V 0 ) = 1 j e - 0.37 and 37% of the samples survive. The constant m, 
known as the Weibull modulus, reflects how rapidly the strength falls as oy is 
approached. The greater the m value, the greater is the probability that the 
material will fail at very nearly the same stress level. In a material like steel, 
m = 100 and the reader can show that a plot of P vs. a resembles a step 
function. Thus, for a < oy, P - 1 and for a > oy, P — 0. This expresses the 
fact that steel has a rather well-defined fracture stress, oy. For brittle ceramics, 
however, m may be 5-10, and there is a much greater variability in strength, 
as evidenced by a statistically large spread in hardness values. These facts are 
displayed in Fig. 12-8 on the Weibull probability axes. If the sample volume 


In (o/o 0 ) 



Figure 12-8. Survival probability of various materials plotted on Weibull probability 
axes. (Reprinted with permission. From Ref. 1; © 1986; Pergamon Press). 
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(V) is n times larger than K 0 , i.e., V = nV Qy the probability of incorporating 
a fatal defect is enhanced leading to a smaller value of P . Under these 
circumstances, P(V) = [.PCK 0 )]" and 


P( V) = exp 


V 

Vo 



( 12 - 10 ) 


Equation 12-10 is more strictly valid for bulk solids, but it has been applied to 
the study of macrocrack distributions in TiC coatings 2-10 pm thick (Ref. 15). 
As expected, more cracks are observed in thicker deposits, a reason for 
restricting coating thickness in practice. To analyze such situations, we can 
substitute film thickness for volume in Eq. 12-10. 


12.3.6. Griffith Crack Theory and Fracture Toughness 

We now turn to the issue of the stability of a single crack under load. Fracture 
mechanics provides a basis for addressing this situation. The Griffith theory 
considers a thin flat crack of length 2/ and depth d in a plate (coating) that is 
assumed to be elastically isotropic. Tensile stresses act to open the crack, as 
shown in Fig. 12-9a. As the applied stress increases, more and more elastic 
strain energy E £ is released from the coating. At any stress level this energy 
has a magnitude of (1/2 )<te (e is the strain) per unit volume and represents 



o 



Figure 12-9. (a) Flat elliptical crack of length 2/ in a coating loaded uniaxially in 

tension, (b) Crack leading to fracture of coating, (c) Crack leading to delamination of 
coating. 
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the area under the elastic stress-strain curve. By Hooke’s law, a ~ Eb and 

E £ = o 2 /2E. (12-11) 

The elastic forces are resisted by interatomic bonds that must be broken for 
further crack extension. At any instant the total energy E r associated with 
these opposing tendencies is given by 

<t 2 

E T = - ~—irl 2 d + Ayld. (12-12) 

2 E 

The elastic strain energy is assumed to be roughly concentrated in a disk of 
volume irl 2 d surrounding the crack. The second term represents the effective 
energy associated with broken bonds, where 7 is the surface energy per unit 
area of crack. Rapid extension of the crack occurs when the rate of strain 
energy release exceeds the rate at which energy is absorbed by bonds at the 

crack tip. The instability occurs when dE r /dl = 0, and simple differentiation 

then yields a critical stress for crack propagation given by 

<7 f = \JAyE/ -nl. (12-13) 

This important formula connects Design , Processing and Materials', o c 
can be thought of as a design stress, / as the flaw size in the coating, an 
unavoidable consequence of processing (deposition) and yE as the intrinsic 
material property. It is common to refer to \j4yE as K c> a material property 
known as the fracture toughness. Fast fracture will then occur when 

> tf c . (12-14) 

As a simple application of Eq. 12-14, consider a coating of thickness d with 
a through crack to the substrate. Either fracture (Fig. 12-9b) or delamination of 
the coating (Fig. 12-9c) could occur under stress application. The condition for 
the former possibility is a c d x/2 > K c (c)/\Ztt , where K c (c) is the fracture 
toughness of the coating. Similarly, coating delamination will occur when the 
stress normal to the surface reaches a value such that o c d xjl > K c (i)/\T k . In 
this expression K c (i) is equal to y/4yE , with 7 interpreted as an interfacial 
debonding energy. 

In microhardness tests on brittle materials and coatings, cracks usually 
spread out from the indentation crater along low-index cleavage planes. 
Through measurement of the indentation crack dimensions and hardness, 
values of K c (c) and K c (i) have been estimated through complex, semiquanti- 
tative relationships (Ref. 16). K c has the peculiar units of psi-in ,/2 or MPa- 
\[m , and values for several important coating materials and substrates are 
entered in Table 12-1. Metals are typically more than an order of magnitude 
tougher than either ceramic oxides or covalent materials in resisting fracture. 
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As an example, let us estimate the critical stress required to propagate 

, o 

microdefect cracks 10 A in size within an A1 2 0 3 coating. For A1 2 0 3 , 
K c (c) « 3.5 MPa -\/m and therefore by Eq. 12-14, o c = 3.5/ 
y/7 rl0 4 X 10“ 10 = 1.97 X 10 3 MPa. This value is equivalent to 200 kg/mm 2 
or 2 x 10 10 dynes/cm 2 , which is an order of magnitude larger than expected 
internal stresses. Therefore, it can be concluded that flaws of this size are 
subcritical. The benevolence of nature in generally providing for internal 
compressive (growth) stresses in brittle coatings should be appreciated. How¬ 
ever, through-thickness cracks in thicker coatings are much more susceptible to 
propagation leading to eventual coating fracture, particularly in the presence of 
tensile stresses. These are commonly generated in situations where coatings are 
stretched during contact with other material surfaces. We now turn our 
attention to such effects. 


12.4. Tribology of Films and Coatings 


Tribology is the study of phenomena associated with interacting surfaces in 
relative motion. Adhesion, hardness, friction, wear, erosion, and lubrication 
are among some of the scientific and technological concerns of this important 
subject. From an engineering standpoint three separate categories of behavior 
can be distinguished based on the relative magnitudes of friction and wear 
(Ref. 17): 

1. Friction and wear are both low. This is the case in bearings, gears, cams, 
and slideways. 

2. Friction is high but wear is low. This combination of properties is desired 
in devices that use friction to transmit power, e.g., clutches, belt drives, 
tires. 

3. Friction is low and wear of one contacting body is high. This is the 
situation that prevails during material removal processes such as machining, 
cutting, and grinding. In these operations plastic deformation is involved, 
whereas in the first two categories the contacting bodies are generally 
elastically stressed. 

We now explore the concepts of friction and wear in more detail. 

12,4.1. Friction 

All engineering surfaces are rough and characterized by a density of projec¬ 
tions or asperities with some distribution of heights. Surfaces may be thought 
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Figure 12-10. Types of contact between rough surfaces (1 and 3) through film layer 
(2). (Reprinted with permission from Elsevier Sequoia, S.A., from J. Hailing, Thin 
Solid Films 108, 103, 1983). (a) Film thickness is less than roughness, (b) Film 
thickness is of the same order as roughness, (c) Film thickness is much greater than 
roughness. 


of as producing contact with each other at the summits of the asperities. The 
sum of these local contact areas represents the real area of contact, which may 
only be but a fraction of the apparent geometric contact area. In the majority of 
contacts surface films are present with properties different from the underlying 
bulk material. Surface films may be gases, fluids (e.g., oil), or deposited solid 
layers such as graphite, metals, or ceramic coatings. In Fig. 12-10 we consider 
three cases of contact between rough surfaces with an intervening film layer 
(Ref. 17). In the first, the film thickness is less than the roughness so that some 
asperities pierce the film enabling contact between bulk bodies (i.e., 1/3 
contact). In Fig. 12-10b the film (2) is still thin but of the order of roughness 
dimensions. All contacts are now between the upper body and the film (i.e., 
1/2 contact). In Fig. 12-10c the film is thicker, and all contacts are also of 1 /2 
type. Surface roughness is not expected to play a marked role here. 

If the subscripts a and b are associated with 1/3 and 1/2 contacts, 
respectively, then standard notions of friction yield 



(12-15) 
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In these definitions / is a nondimensional term representing the ratio of the 
interfacial shear stress r to the applied normal pressure P. The overall 
coefficient of friction / is given by the ratio of the total real shear-to-normal 
forces or 


r a^ a + r _bA_b 
P a A a + P b A b ’ 


(12-16) 


where note is taken of the total real area A for each type of contact. 
Furthermore, if the respective forces borne by each contact are F a = P a A a 
and F b = P b A b , then 


/ = /* 


Iff i \ 

F + 1 / 


(12-17) 


where / = f a /f b and F = F a /F b . No assumption has been made as to the 
nature of the film and therefore Eq. 12-17 is generally applicable to all types of 
contacts. For example, in the absence of a film, F b = 0 and / = f a as in the 
simple theory of friction. In the limit of large F, the value of / also tends to 
f a . At the other extreme where F a -► 0, f~*f b and frictional effects are 
governed by the film characteristics. 

The dependence of frictional effects on the film thickness is of considerable 
technological importance. Consider a soft solid film such as lead on steel. As 
the film thickness increases, / decreases from f a (steel-steel) to f b 
(lead-steel), reflecting the changing nature of contact. Beyond this thickness, r 
essentially stays the same but P, which is related to the effective film hardness 
changes. Thin lead films are harder than thicker ones because of the support 
provided by the steel. Therefore, P decreases as the film thickness increases, 
leading to an increase in f b or /. This causes a friction variation in film 
thickness that passes through a minimum. The same effect occurs with thin 
fluid films used to lubricate metal surfaces in contact. The well-known Stribeck 
curve then displays a minimum in the coefficient of friction as a function of 
thickness or equivalent quantity (viscosity * speed/pressure). As the film thick¬ 
ness increases, boundary, quasi-hydrodynamic and hydrodynamic lubrication 
regimes are successively operative. 

An important set of applications involves the dry lubrication of moving parts 
where fluid lubrication is not possible, e.g., at high and low temperatures or 
pressures. Thin, low-friction solid films composed of chalcogenides, oxides, 
fluorides, or carbon are used instead. In particular, MoS 2 is a favored 
lubricant and has been deposited by reactively sputtering MoS 2 targets in an 
H 2 S ambient. 
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12.4.2. Wear Mechanisms 

Wear may be defined as the progressive removal of material from surfaces that 
are under load in relative motion. Several different mechanisms have been 
identified to characterize contact wear, and these are briefly described below 
(Ref 18). 

12.4.2.1. Adhesive Wear. Adhesive wear occurs when applied tangential 
forces cause fracture between surfaces bonded at asperities. One option is for 
the fracture path to follow the original microwelded interface. Other paths lie 
above or below the interface when the strength of the bonds between asperities 
exceeds the cohesive strength of the bodies in contact. The result is material 
transfer, usually from the softer to harder body. During subsequent surface 
motion cycles, these particles may eventually be removed by fatigue fracture. 
In more severe cases of adhesive wear, smearing, galling, and seizure of 
surfaces may occur. 

12.4.2.2. Abrasive Wear , Abrasive wear is a form of cutting wear where 
the material is removed by hard wear particles, by hard asperities, or by hard 
particles entering the interface from the environment. 

12.4.2.3. Fatigue Wear. Fatigue wear occurs in situations where there is 
repeated loading and unloading of surfaces in contact. Failure may initiate at 
both surface flaws or cracks or at subsurface inhomogeneities. Crack growth 
eventually results in detached wear particles. 


12.4.2.4. Fretting Wear. Fretting wear may be viewed as a type of fatigue 
wear that occurs under conditions of oscillatory movement of small amplitude 
(in the range of 1-200 jun), but relatively high frequency. Many sequential 
damage processes occur during fretting, including breakup of protective films, 
adhesion and transfer of material, oxidation of metal wear particles, and 
nucleation of surface cracks. 

12.4.2.5. Delamination Wear . Delamination wear takes the form of regu¬ 
lar detachments of thin platelike particles from wearing surfaces due to the 
influence of high tangential (friction) forces in the surface contact zone. During 
cyclic loading the cracks that develop propagate parallel to the surface at a 
depth governed by the material properties and coefficient of friction. 
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12.4.2.6, Oxidation Wear . Oxidation wear arises from the continuous rub¬ 
bing and removal of surface films produced by reaction with the environment. 
Wear damage is modest in this case because oxide regrows soon after it is lost. 
However, at high temperatures where chemical reactions are accelerated 
oxidation wear is aggravated. 

Quantifying the various wear mechanisms is a challenge that continues to 
engage the efforts of workers in the field. However, expressions for adhesive 
wear have been developed based on the probability that a junction between the 
two sliding surfaces produces a wear particle. A commonly employed phe¬ 
nomenological formula is 

V = K w Fl/H, (12-18) 

where V is the volume of material removed, F is the normal force, / is the 
length of travel, H is the indentation hardness of the softer body, and K w is a 
dimensionless wear coefficient. A similar formula holds for wear of films 
attached to substrates. The value of K w depends on a number of factors. Most 
important is the nature of the materials in contact, but the extent and type of 
lubrication, surface temperature, and nature of the chemical environment also 
play a significant role. In order of increasing wear are the material pairs: 
nonmetal-metal, nonmetal-nonmetal, dissimilar metals, and similar metals. 

12.4.3. Tool Wear 

In the important application of hard coatings for tools, wear under cutting 
conditions appears to be governed by two processes: abrasive wear and 
chemical dissolution. The latter process is important at tool temperatures above 
1000 °C, which are typical for the high-speed machining of steel with coated 
WC tools. Dissolution of the coating atoms in the hot chips softens the cutting 
edge and leads to tool degradation. For the case of abrasive wear, a simple 
semiquantitative model to predict the relative performance of tool coatings in 
machining metals has been proposed and is worth presenting here. The results 
should be applicable to machining at low speeds or at low temperatures. The 
basic assumption is that three-body abrasion describes abrasive wear. In this 
model the abrasive particles are free to roll between two solid bodies translat¬ 
ing relative to each other. One can imagine that abrasive inclusions at the 
(two-body) chip-tool interface approximate this condition. 

In three-body abrasion Rabinowicz (Ref. 19) has found that the wear 
behavior can be divided into three regions. These depend on the ratio of the 
hardness of the abrasive inclusions (e.g., oxides, compounds, precipitates, 
hard phases, etc.) in the workpiece to that of the surface being abraded (i.e.. 
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the tool coating). Defining the former by H a and the latter by H n the wear 
volume ( V t ) removed from the tool is essentially expressed in each hardness 
regime by 


V, 

K 

K 


Kwfl 

3 H, 1 

K w Fl ( 
5.3 H, \ 

K w Fl 

2.43//, 


H, 
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(12.19a) 

(12.19b) 


(12.19c) 


When the coating is soft relative to the workpiece inclusions, then Eqs. 12-18 
and 12- 19a are basically the same and tool wear is rapid. With harder, more 
wear-resistant coatings, V t decreases dramatically as an inverse power of 
H,IH a . 

As an example, let us try to predict the relative abrasive wear suffered by 
TiN and high-speed (HSS) tools when machining steel containing hard abrasive 
particles of Fe 3 C. Under cutting conditions it is assumed that the tool-work 
interface reaches 200 °C. To apply Eqs. 12-19, we need hot hardness values, 
which can be calculated from Eq. 12-4 and the data in Table 12-1. The 
estimated hardnesses are 


TiN = 1310 kg/mm 2 
HSS = 900 kg/mm 2 
Fe^C = 865 kg/mm 2 

For TiN, H TiN /H Fe3C - 1.51 (region c), while for HSS, #hss/^f Ci c = 1-04 
(region b). Therefore, substitution in Eq. 12-19 yields 

Khss = 2.43 1310 (l ,04) 2 5 
K TlN 5.3 900 (l .51) 6 

Therefore, TiN-coated tools are predicted to outlast HSS tools by a factor of 
over 7 in this case. 

There have been numerous comparisons between the performance of coated 
vs. uncoated tools. In one study (Ref. 20) the ratio of the number of acceptable 
holes drilled by TiN-coated twist drills vs. uncoated HSS drills, prior to 
regrinding, was determined. The ratios ranged from 3 when drilling plain 
carbon steel to over 5 for both spring steel and gray cast iron. Reactive ion 
plating was used to deposit the TiN coatings in order to minimize drill 
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Figure 12-11. Flank wear of uncoated and coated carbide lathe tools as a function of 
time in machining cast iron. (Courtesy of A. Gates, Multi-Arc Scientific Coatings.) 


distortion. The results of similar benefits derived from single-layer TiC 
coatings relative to uncoated tools are shown in Fig. 12-11. Comparisons are 
made for lathe tools turning cast iron. Significantly, bilayer TiC-TiN and 
trilayer TiC-Al 2 0 3 -TiN coatings are considerably longer-lived for some of 
the reasons noted earlier. Enhanced thermal shock and mechanical impact 
resistance enable these coated tools to maintain high cutting edge strength and 
more effectively cope with tool face, edge, and flank wear. 


12.4.4. Erosion (Ref. 21) 

Erosion is a type of abrasive wear in which material is lost from a surface that 
is in relative motion with a fluid (gas or liquid) containing solid particles. 
Unlike three-body wear, which was just considered, erosion is a two-body 
wear phenomenon. It occurs in earth-moving and agricultural equipment as 
well as in machinery handling slurries. We will, however, only briefly consider 
erosion and erosion-resistant coatings as they relate to gas-turbine engines 
(Ref. 22). Ingestion of particulate matter in the front end of the engine results 
in erosion problems downstream. Large particles at the front end typically 
impinge normally and wear down the leading edge of the airfoil. Further 
downstream in the compressor, the particles tend to be smaller and are 
centrifuged toward the rotor periphery. The lower impingement angle affects 
the midchord and even trailing edge of the airfoil. Factors such as particle 
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IMPACT ANGLE 

Figure 12-12. Erosion behavior of ductile and brittle materials. (From Ref. 22). 

velocity, size, shape, hardness, and concentration all directly affect the magni¬ 
tude of erosion. However, it is the angle of impingement of the erodent that 
appears to have the greatest influence on erosion. Ductile materials exhibit 
maximum erosion at impingement angles of 15° to 30° and erode less at higher 
angles. Apparently, low-angle impingement results in a tensile component that 
stretches the material to fracture, whereas compression and hardening of the 
surface by peening are the result of normal impact. Conversely, brittle 
materials exhibit a continuously increasing erosion rate with impingement 
angle and erode the maximum rate at 90°. These opposing trends are illus¬ 
trated in Fig. 12-12. 

Coatings have been used to counter erosion problems, but the selection of 
materials that can provide protection over the entire angular range has proven 
elusive. For example, compressor blade materials such as Ti-6V-4A1 or 
Inconel 718 are ductile and can withstand high-angle impingement. On the 
other hand, carbide or boride coatings, which may be perhaps, 10 times more 
resistant to erosion at low angles, provide little or no protection at high angles. 
Composite coatings have been proposed as a solution. 


12.4.5. Ball-Bearing Coatings 

One of the most critical applications of hard coatings has been in ball bearings. 
Used in virtually all precision rotating machinery, the four basic-bearing 
components are the inner race, the outer race, the balls, and the cage as shown 
in Fig. 12-13a. The surface quality of the metallic components in rolling and 
sliding contact is of great importance. In particular, the roughness of the 
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INNER RACE 
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BALLCAGE 


b. 


d. 





Figure 12-13. (a) Ball-bearing components: inner race or ring; outer race or ring; 

ball; cage or ball retainer, (b) Schematic representation of unlubricated steel ball-to-steel 
race contact, (c) Schematic representation of lubricated ball-to-race contact, (d) 
Schematic representation of unlubricated TiC coated ball-to-race contact. (Reprinted 
with permission from Elsevier Sequoia, S.A.. H. J. Boving and H. E. Hintermann, 
Thin Solid Films 153, 253, 1987). 

raceways and balls strongly influences the effective life of the bearing. When 
the roughness is very low, then the oil film carries the load as desired, and 
minimal metal-to-metal contact occurs. However, when the roughness is high, 
the surface asperities occasionally impinge to cause local contact and micro- 
welding. This is shown in Figs. 12-13b and c, representations of the steel 
ball-steel race contact in the absence and presence of oil-grease lubrication. 
Even though the microwelds rupture almost instantaneously, the bearing 
interface roughens. The more rapid deterioration of the bearing when no 
lubricant is present is evident. Many more microwelds form, and their fracture 
releases hard abrasive metal particles. Such a situation arises in bearing 
applications where no lubricant is permitted because of environmental restric¬ 
tions. For such demands, bearings coated with several microns of hard 
compounds such as TiC and TiN exhibit dramatically improved behavior as 
illustrated in Figure 12-13d. In this case a TiC-coated ball contacts an uncoated 
race. Longer bearing life, accompanied by lower noise and vibration, occurs 
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because of several beneficial effects. When the steel impinges on TiC almost 
no microwelding or adhesion occurs between these dissimilar materials. Upon 
contact, the harder TiC tends to flatten the raceway asperities by plastic 
deformation. This process, accompanied by a smaller temperature increase 
than during microweld fracture, leads to lower wear and slower lubricant 
degradation. 

There are several issues related to bearing coatings that deserve further 
comment. First is the question of what to coat—the races, the balls, or both. 
The 1000 °C CVD TiC coating process represents a severe thermal treatment 
for high-precision bearing components. Furthermore, final hardening and 
quenching treatments are required. The geometric and size distortions accom¬ 
panying these thermal cycles are decided disadvantages, especially for large 
bearing rings and raceways. Tribology considerations require that only one 
partner of the contact couple be coated. The logical alternative to coated races 
is to use coated balls, an increasingly accepted option. Second, there is the 
challenge to high-temperature CVD by low-temperature PVD processes. Even 
so, CVD has significant advantages. Since the coating treatment lasts for 
several hours, a significant amount of diffusion between the substrate and 
coating occurs. This results in better adhesion, progressively graded mechani¬ 
cal properties across the interface and improved fatigue resistance. Finally, the 
as-deposited TiC coating surface is too rough for use in bearings; however, 
with high-precision lapping and polishing the coated balls become extremely 
smooth with a surface roughness considerably lower than attainable with 
uncoated steel. 

The benefits of coated bearings in specific unusual applications have been 
noted in the literature (Ref. 24). One example involves the orbiting European 
Meteosat telescope. The positioning mechanism of this telescope contains ball 
bearings with TiC-coated races and steel balls operating under a vacuum of 
7 X 10“ 9 torr at temperatures between -80 to 120 °C. The bearings have 
functioned perfectly for several years. In other coated-bearing applications 
involving nuclear reactors and navigation gyroscope motors, performance at 
elevated temperatures (i.e., 300 °C) and high rotational speeds (i.e., 24,000 
rpm) were respectively evaluated. In both cases TiC-coated bearings consider¬ 
ably outperformed uncoated bearings. 

From these and other testimonials on bearing behavior, it is clear that the 5 
millenia evolution of ways to support and move loads from sliding to rolling 
friction, from single contact wheels to multiple contact rollers has entered a 
modem phase of coating utilization. One might say that it is a whole new 
“ball” game. 
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12.5. Diffusional, Protective, and Thermal Coatings 


12.5.1. Diffusion Coatings 

Diffusion coatings are not coatings in the sense normally meant in this chapter. 
They are produced by a type of CVD reaction in which the element of interest 
(e.g., C, N, B, Si, Al, or Cr) is deposited on and diffused into a metal 
substrate (usually steel), in which it is soluble. The corresponding carburizing, 
nitriding, boronizing, siliciding, aluminizing, and chromizing processes yield 
surfaces that are considerably harder or more resistant to environmental attack 
than the base metal. Doping of semiconductors in which infinitesimal levels of 
solute are involved should be distinguished from diffusional coating processes. 
Through diffusion, the surface layers are frequently enriched beyond the 
matrix solubility limit, and when this happens, compounds (e.g., Fe 3 C, Fe 4 N, 
Fe 2 B) or intermediate phases (e.g., iron and nickel aluminides) precipitate, 
usually in a finely dispersed form. Sometimes, however, a continuous subsur¬ 
face compound layer forms. Since these compounds and phases are frequently 
harder than the matrix, they strengthen the surface to a depth determined by 
the diffusional penetration. The lack of a readily identifiable planar interface 
between different materials means that there is no need to be concerned about 
adhesion in such diffused layers. 

Carburization of steel is easily the most well-known and widely used 
diffusional surface treatment. Carbon-rich gases such as methane are made to 
flow over low-to-medium carbon steels (0.1 to 0.4 wt% C) maintained at 
temperatures of - 900 °C. Pyrolysis at the metal surface releases elemental 
carbon that diffuses into austenite or y-Fe, a high-temperature, face-centered 
cubic phase of Fe capable of dissolving about 1.25 wt% C at 920 °C. After 
sufficient carbon enrichment, y-Fe can be subsequently transformed to the 
hard tetragonal martensite phase simply by rapidly quenching the hot steel to 
ambient temperature. A hard, wear-resistant case or layer of martensite 
containing roughly 1 wt% C then surrounds the softer mild steel core. Many 
automotive parts, machine components and tools such as gears, shafts, and 
chisels are carburized. The hard-wearing surface is backed by the softer, but 
tougher matrix that is required to absorb impact loading. 

In order to design practical diffusional coating treatments, we must have 
phase compositions and solubilities, available from phase diagrams, together 
with diffusivity data. For example, the subsurface carbon concentration C(x , t) 
during carburization of mild steel of composition C 0 is given by 

C(x t t) — C 0 = (C s — C 0 )erfc ^ \p[)i 


(12-20) 
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where C s , the surface carbon concentration, depends on the solubility of 
carbon in the steel at the particular temperature. Other terms in Eq. 12-20 have 
been previously defined ( cf . p. 35); the value for the diffusivity of C in Fe is 
given by D = 0.02 exp[ - (20.1 kcal/mole)//?r] cm 2 /sec. For typical tem¬ 
peratures (— 920 °C) and times (~ 1 h) case depths of the order of 1000 /xm 
are produced. Even harder steel surfaces on steel can be produced by nitriding. 
Ammonia pyrolysis at 525 °C provides the N, which then penetrates the steel 
with a diffusivity given by D - 0.003 exp[-(18.2 kcal/mole)//?r] cm 2 /sec. 
After two days case layers possessing a hardness of H v 900-1200 extending 
about 300 /xm deep can be expected. Conventional nitriding should be com¬ 
pared with ion-implantation methods for introducing nitrogen into steels. This 
technology, discussed in Chapter 13, only modifies layers several thousand 
angstroms deep. 

As a final, but nevertheless important, example of a diffusion-coating 
process we consider aluminizing. Coatings based on A1 have been used for 
several decades to enhance the environmental resistance of materials to high- 
temperature oxidation, hot corrosion, particle erosion, and wear. Aluminized 
components find use in diverse applications—nuclear reactors, aircraft, and 
chemical processing and coal gasification equipment. 

Metals subjected to aluminizing treatments include Ni-base as well as 
Fe-base superalloys, heat-resistant alloys, and a variety of stainless steels. In 
common these alloys all contain substantial amounts of Ni, which is required 
for reaction with Al. Parts to be coated are packed in a retort containing A1 
salts, activators, and gases capable of reacting and transporting the Al to the 
surface being treated, in a CVD-like process. Upon solid-state diffusion, the 
intermetallic compound NiAl forms on the surface. This layer is hard and lacks 
ductility, but exhibits low wear and friction as well as impressive high-temper¬ 
ature corrosion resistance to both sulfur-containing gases and liquid sodium. 
Beyond the outer NiAl layer is a region containing a fine dispersion of Ni 3 Al 
precipitates that serve to strengthen and toughen the matrix. Typically both 
regions combined do not extend deeper than - 100 /xm from the surface. 

12.5.2. Oxidation and Oxide Films 

The universal response of metal surfaces exposed to oxygen-bearing atmo¬ 
spheres is to oxidize. The oxidation product may be a thin adherent film that 
protects the underlying metal from further attack, or a thicker porous layer that 
may flake off and offer no protection. In this section, discussion is limited to 
oxidation via high-temperature exposure; aqueous corrosion oxidation phenom¬ 
ena are already the subject of a broad and accessible literature. From the 
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M° - e-► M +1 

1/2 O 2 + 2e-► CT 2 

Figure 12-14. Mechanisms of oxidation: (a) oxide growth at oxide-ambient inter¬ 
face. (b) oxide growth at oxide-metal interface. 

standpoint of thermodynamics all of the structural metals exhibit a tendency to 
oxidize. As noted in Chapter 1, the driving force for oxidation of a given metal 
depends on the free-energy change for oxide formation. What thickness of 
oxide will form and at what rate are questions dependent on complex kinetics 
and microstructural considerations, and not on thermodynamics. As shown in 
Fig. 12-14, two simultaneous processes occur during oxidation. At the 
metal-oxide interface neutral metal atoms lose electrons and become ions that 
migrate through the oxide to the oxide-ambient interface. The released elec¬ 
trons also travel through the oxide and serve to reduce oxygen molecules to 
oxygen ions at the surface. If metal cations migrate more rapidly than oxygen 
anions (e.g., Fe, Cu, Cr, Co), oxide grows at the oxide-ambient interface. On 
the other hand, oxide forms at the metal-oxide interface when metal ions 
diffuse more slowly than oxygen ions (e.g., Ti, Zr, Si). An important 
implication is that highly insulating oxides, such as A1 2 0 3 , Si0 2 , do not grow 
readily because electron mobility, so central to the process, is low. This is 
what limits their growth and results in ultrathin protective native oxide films. 

The model of growth kinetics developed for oxidation of Si, and Eq. 8-34 in 
particular, is applicable to other systems. Both parabolic oxide growth under 
diffusion-controlled conditions, as well as linear oxide growth when interfacial 
reactions limit oxidation, are frequently observed. However, not all oxidation 
processes fit the aforementioned categories, and other growth rate laws have 
been experimentally observed in various temperature and oxygen pressure 
regimes (Ref. 25). Specific formulas for the oxide thickness c/ 0 , with constants 
C | , C 2 , * . - , C 7 , include 

Cubic rate law 


dl = C,f + C 2 (e.g., Ti-400 °C). 


(12-21) 
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Logarithmic 

d 0 = C 3 ln(C 4 / + C 5 ) (e.g., Mg-100 °C). (12-22) 

Inverse Logarithmic 

\/d Q = C 6 - C 7 ln / (e.g., Al-100 °C). (12-23) 

In fact, careful plotting of data reveals that many metals and alloys appar¬ 
ently exhibit a number of different rates, depending on temperature. Most 
metals gain weight during oxidation, but, interestingly, metals like Mo and W 
lose weight during oxidation. The reason is that the oxide films that form 
(Mo0 3 and W0 3 ) are volatile and evaporate as soon as they form. 

The physical integrity of the oxide coating is the key issue that determines its 
ability to protect the underlying metal. If the oxide that forms is dense and 
thin, then it can generally be tolerated. If it is porous and continues to grow 
and spall off, the exposed underlying metal will undergo further deterioration. 
Whether the oxide formed is dense or porous can frequently be related to the 
ratio of oxide volume produced to the metal consumed. The quotient, known as 
the Pilling-Bedworth ratio, is given by 


Volume of oxide M 0 p m 

Volume of metal xM m p 0 ’ 


(12-24) 


where M and p are the molecular weight and density, respectively, of the 
metal (m) and oxide (o), and x is the number of metal atoms per molecule of 
oxide M x O. If the ratio is less than unity, then compatability with the metal 
will create residual tensile stresses in the oxide. This will generally split it, 
much like dried wood, and make it porous, affording little protection to the 
underlying metal. If the ratio is close to or greater than unity, there is a good 
chance the oxide will not be porous; it may even be protective. On the other 
hand, if the ratio is much larger than unity, the oxide will acquire a residual 
compressive stress. Wrinkling and buckling of the oxide may cause pieces of it 
to spall off. For example, in the case of A1 2 0 3 the Pilling-Bedworth ratio is 
calculated to be 1.36, whereas for MgO the ratio is 0.82. The lack of a 
protective oxide in the case of Mg has limited its use in structural applications. 

What we have said of oxidation applies as well to the sulfidation of metals in 
S0 2 or H 2 S ambients. Metal sulfides are particularly deleterious because of 
their low melting temperatures. Liquid sulfide films tend to wet grain bound¬ 
aries and penetrate deeply, causing extension of intergranular cracks. Whether 
the elevated temperature atmosphere is oxidizing or sulfidizing, structural 
metals must be generally shielded by protective or thermal coatings. 
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12.5.3. Thermal Coatings (Refs. 26, 27) 

Ever-increasing demands for improved fuel efficiency in both civilian and 
military jet aircraft has continually raised operating temperatures of turbine 
engine components. Among those requiring protection are turbine blades, 
stators, and gas seals. The metals employed for these critical applications are 
Co-, Ni-, and Fe-base superalloys, which possess excellent bulk strength and 
ductility properties at elevated temperatures. A widely used cost-effective way 
to achieve yet higher temperature resistance to degradation in the hot gas 
environment is to employ an additional thermal barrier coating (TBC) system. 
This consists of a metallic bond coat and a top layer composed primarily of 
Zr0 2 * The bond coating, as the name implies, is the glue layer between the 
base metal and the outer protective oxide. Its function is not unlike that of a 
bond or primer coating used to prepare surfaces for painting. Typical bond 
coatings consist of MCrAlY or MCrAlYb, where M = Ni, Co, Fe. Original 
bond coating compositions such as Ni-26Cr-6Al-0.15Y (in wt%) have been 
continually modified in an effort to squeeze more performance from them. The 
role of Y or other rare earth substitutes is critical. These elements apparently 
protect the bond coat from oxidation and shift the site of failure from the base 
metal and coat interface to within the outer thermal barrier oxide. Just why is 
not known with certainty; it appears that these reactive metals easily diffuse 
along the boundaries of the plasma-sprayed particles of the bond coating, 
oxidize there, and limit further oxygen penetration. 

The use of Zr0 2 is based on a desirable combination of properties: melting 
point = 2710 °C, thermal conductivity = 1.7 W/m-K, and thermal expansion 
coefficient = 9 X 10“ 6 K“‘ (Ref. 28). However, the crystal structure under¬ 
goes transformation—from monoclinic to tetragonal to cubic—as the tempera¬ 
ture increases, and vice versa, as the temperature decreases. A rapid, diffu- 
sionless martensitic transformation of the structure occurs in the temperature 
range of 950-1400 °C accompanied by a volume contraction of 3-12%. The 
thermal stresses so generated lead to fatigue cracking, which signifies that 
Zr0 2 alone is unsuitable as a TBC. The Zr0 2 overlayers are generally 
stabilized with 2-15 wt% CaO, MgO, and Y 2 0 3 . Through alloying with these 
oxides, a partially stable cubic structure is maintained from 25 °C to 2000 °C. 
Actually the tetragonal and monoclinic phases coexist together with the cubic 
phase, whose stabilization depends on the amount of added oxide. Cubic phase 
stabilization results in stress-induced transformation toughening, which can be 
understood as follows. If a crack front meets a tetragonal particle, the latter 
will transform to the monoclinic phase a process that results in a volume 
increase. The resultant compressive stresses blunt the advance of cracks, 
toughening the matrix. 
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Both bond and thermal barrier coatings are usually deposited by means of 
plasma spraying. This process is carried out in air and utilizes a plasma torch, 
commonly fashioned in the form of a handheld gun. An arc emanates from the 
gun electrodes and is directed toward the workpiece. Powders of the coating 
material are introduced into the plasma by carrier gases that drive them into the 
arc flame. There they melt and are propelled to the workpiece surface where 
they splat and help to build up the coating thickness. Typical bond and thermal 
barrier coating thicknesses are 200 and 400 /im, respectively. Exposures to 
temperatures of 1100 to 1200 °C, to thermal cycling, and to stresses are 
common in the use of TBC systems. 



Exercises 



1. a. If the potential energy of interaction between neighboring atoms in 

hard compounds is 

V(r) = -A/r m + B/r* 

(see problem 1-5), show that the modulus of elasticity is given by 
E = m(n - m)A /a™ + 3 

(tf 0 is the equilibrium lattice spacing). 

b. Show that E is proportional to the binding energy density or 

E = -mnV(r = a 0 )/al. 

c. How well does this correlation fit the data of Fig. 12-7? 

2. Why are epitaxial hard coatings of TiC or TiN not practical or of 
particular interest? 

3- A hardness indenter makes indentations in the shape of a tetrahedron 
whose base is an equilateral triangle that lies in the film plane. 

a. If the side of this triangle has length , what is the depth of 
penetration of the indenter? 

b. What is the hardness value in terms of applied load and /,? 

4. Compare the relative abrasive wear of TiC-coated tools vs. HSS tools 
when machining steel containing Fe 3 C particles. Assume the same wear 
model and machining characteristics as in the illustrative problem on 
p. 575. 

5- Molten steel at 1500 °C can be poured into a quartz crucible resting on a 
block of ice without cracking it. Why? Calculate the stress generated. 
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6 - A coating has small cracks of size / that grow by fretting fatigue. Assume 
the crack extension rate is given by 

dt — 

- = A AK n ' with AK = \/W Aa, 

dN 


where A a the range of cyclic stress, N is the number of stress cycles, 
and A and m are constants. By integrating this equation, derive an 
expression for the number of cycles required to extend a crack from /, 
to l f . 

7. Speculate on some of the implications of the Weibull distribution for hard 
coating materials if 

a. the volume can be replaced by the coating thickness. 

b. the tensile strength is proportional to coating hardness (this is true for 
some metals). 

c. the Weibull modulus is 10. 


8 - Oxidation rates are observed to vary as 
A 



d(d 0 ) 

a. 

dt 

w 

d(d 0 ) 

D. 

dt 


d(d 0 ) 

c. 

Dt 

A, 

B, C, 


= B exp - Cd 0 
E 

= D exp — 


thickness ( d 0 ) vs. time (/) for each case. 

9- Contrast the materials and processes used to coat sintered tungsten 
carbide lathe tool inserts and high-speed steel end-mill cutters. 


1 0- By accident a very thin discontinuous rather than continuous film of TiC 
was deposited on the steel races of ball bearings. How do you expect this 
to affect bearing life? 


11. The Taylor formula Vt” = c, widely used in machining, relates the 
lifetime (/,) of a cutting tool to the cutting velocity (V). Constants n and 
c depend on the nature of the tool, work, and cutting conditions. A 
TiN-coated cutting tool failed in 40 min when turning a 10-cm-diameter 
steel shaft at 300 rpm. At 250 rpm the tool failed after 2 h. What will the 
tool life be at 400 rpm? 
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12. Assume that K w for adhesive wear is 10 10 for 52100 steel balls on 
52100 steel races and an order of magnitude lower for TiC-coated 
bearings. 

a. Approximately how many revolutions are required to generate a wear 
volume of 10 -5 cm 3 in an all-steel, 2-cm-diameter bearing if H = 900 
kg/mm 2 and F = 200 kg? 

b. At 1000 rpm how long will it take to produce this amount of wear in 
the coated bearing? 

13- A problem arising during the CVD deposition of TiC on cemented 
carbides is the loss of C from the substrate due to reaction with TiCl 4 . 
This leads to a brittle decarburized layer (the tj phase) between substrate 
and coating. Assuming that interstitial diffusion of C in W is responsible 
for the effect, sketch the expected C profile in the substrate after a 2-h 
exposure to 1000 °C, where the diffusivity is 10 -9 cm 2 /sec. 

14- The surface of a HSS drill is exposed to a flux of depositing Ti atoms and 
a N 2 plasma during reactive ion plating at 450 °C. It is assumed that an 
effective surface concentration of 50 at% N is maintained that can diffuse 
into the substrate. Under typical deposition conditions roughly estimate 
the ratio of layer thicknesses of TiN to Fe 4 N formed as a function of 
time. 
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Modification of Surfaces 
and Films 


13.1. Introduction 


Two main approaches to improving or altering the surface properties of solids 
have evolved over the years. The more traditional one involves the deposition 
of films and coatings from solid, liquid, and vapor sources. Processing 
utilizing these methods has totally dominated our attention in the book until this 
point. But there is another approach of more recent origin based on modifying 
existing surfaces through the use of directed-energy sources. These include 
photon, electron, and ion beams, and it is their interaction with surfaces that 
will be the focus of this chapter. 

Coherent (laser) and incoherent light sources, as well as electron beams, 
modify surface layers by heating them to induce melting, high-temperature 
solid-state annealing or phase transformations, and, occasionally, vaporization. 
In the case of lasers, the relation between the required power density and 
irradiation time is depicted in Fig. 13-1 for a number of important commercial 
processing applications. However, the focus of this book is thin films and in 
the applications shown much thicker layers of material are modified. These 
materials processing techniques will, therefore, not be discussed in any detail, 
nor will there by any additional mention of electron beams. Their heating 
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effects are basically equivalent to those produced by lasers of comparable 
power. Furthermore, the great depth of the heat-affected zone is more typical 
of bulk rather than surface processing. The thin-film or layer-modification 
regime we shall be concerned with is characterized by approximate laser 
energies of — 0.1-2 J/cm 2 , interaction times of ~ 10 -9 to 10 -6 sec, and 
power densities of — 10 6 to 10 8 W/cm 2 . These conditions prevail in the 
indicated region of Fig. 13-1. Surface layers ranging from 0.1 to 10 fim in 
thickness are correspondingly modified by melting under such conditions. The 
melting-solidification cycle frequently does not restore the surface structure 
and properties to their original states. Rather, interesting irreversible changes 
may occur. For example, one consequence of laser processing can be an 
ultrahigh quench rate with the retention of extended solid solutions, metastable 
crystalline phases, and, in some cases, amorphous materials. Directed thermal 
energy sources have also been employed to effect annealing, surface alloying, 
solid-state transformations and homogenization. The controlled epitaxial re¬ 
growth of molten Si layers over Si0 2 or insulators, discussed in Chapter 7, is 
an important example of the great potential of such processing. 

Like photon and electron beams, ion beams play an indispensible role in 
surface analytical methods and have also achieved considerable commercial 
success in surface processing. In the very important ion-implantation process, 
ion beams have totally revolutionized the way semiconductors are doped. 
Depending on the specific ion projectile and matrix combination, dopants can 
be driven below the semiconductor surface to readily predictable depths 
through control of the ion energy. Unlike traditional diffusional doping where 
the highest concentration always occurs at the surface, ion-implanted distribu¬ 
tions peak beneath it. The reduction of the threshold voltage required to trigger 



INTERACTION TIME, SEC 

Figure 13-1. Laser processing regimes illustrating relationships between power den¬ 
sity, interaction times and specific energy. D-drilling; SH-shock hardening; LG-laser 
glazing; DPW-deep penetration welding; TH-transformation hardening. 
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current flow in MOS transistors, by means of ion implantation, ushered in 
battery-operated, handheld calculators and digital watches. Today ion-implan¬ 
tation doping is practiced in MOS as well as bipolar transistors, diodes, 
high-frequency devices, optoelectronic devices, etc., fabricated from silicon 
and compound semiconductors. Achievements in microelectronics encouraged 
broader use of ion implantation to harden mechanically functional surfaces, 
improve their wear and fatigue resistance, and make them more corrosion 
resistant. Critical components such as aircraft bearings and surgical implant 
prostheses have been given added value by these treatments. In addition, there 
are other novel ion-beam-induced surface-modification phenomena such as 
ion-beam mixing, or subsurface epitaxial growth, that may emerge from their 
current research status into future commercial processes. 

The purpose of this chapter is to present the underlying principles of the 
interaction of directed-energy beams with surfaces, together with a description 
of the changes which occur and why they occur. Accordingly, the subject 
matter is broadly subdivided into the following sections: 

13.2. Lasers and Their Interaction with Surfaces 

13.3. Laser Modification Effects and Applications 

13.4. Ion-Implantation Effects in Solids 

13.5. Ion-Beam Modification Phenomena and Applications 


13.2. Lasers and Their Interactions With Surfaces 


13.2.1. Laser Sources 

The intense scientific and engineering research associated with the develop¬ 
ment of lasers has resulted in much innovation and rapid growth of applica¬ 
tions. Space limitations preclude any discussion of the details of the theory of 
laser construction, operation, and applications, which are all covered ad¬ 
mirably in other textbooks (Ref. 1). Suffice it to say, that all lasers contain 
three essential components: the lasing medium, the means of excitation, and 
the optical feedback resonator. The most common lasers employed in materials 
processing contain either gaseous or solid-state lasing media (Ref. 2). Gas 
lasers include the carbon dioxide (C0 2 :N 2 :He), argon ion and xenon fluoride 
excimer types. The solid-state varieties used are primarily the chromium-doped 
ruby, the neodymium-doped yttrium-aluminum-gamet and neodymium-doped 
glass laser. These solid-state lasers are excited through pumping by incoherent 
light derived from flash lamps. Gas lasers, on the other hand, are excited by 
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Figure 13-2. Output power for three modes of laser operation. (From Ref. 2). 


means of electrical discharges. Laser excitation may be continuous or cw, 
pulsed, or Q-switched to provide the different output powers shown schemati¬ 
cally in Fig. 13-2. The distinctions in these power-time characteristics are 
important in the various materials processing applications. In the welding and 
drilling of metals, for example, advantage is taken of the power-time profile in 
the pulsed and Q-s witched lasers. Both the reflectance and the thermal 
diffusivity of metals decrease with increasing temperature. Therefore, the 
high-power leading edge of these lasers is used to preheat the metal and 
enhance the efficiency of the photon-lattice phonon energy transfer. 

In Table 13-1 the common lasers employed in surface processing together 
with their pertinent operating characteristics are listed. Among the important 
laser properties are spatial intensity distribution, the pulse width, and pulse 
repetition rate. The spatial distribution of emitted light depends on the cavity 
configuration with Gaussian (TEM^) intensity profiles common. Because a 
uniform laser flux is desirable in surface processing, methods have been 
developed to convert emission modes into the “top-hat” spatial profile. The 
dwell time or pulse length, r p , ranges from less than 10 nsec to 200 nsec for 
Q-s witched lasers, and many orders of magnitude longer for other types of 
lasers. Repetition rates for pulsed and switched lasers range from one in 
several seconds to many thousands per second. Although the low repetition 
rates of Q-switched lasers may not be practical in industrial processing 
applications because the duty cycle (i.e., time on/time off - 10~ 8 ), is low, they 
are useful for laboratory research. 

It is the magnitudes of both the absorbed radiant power and r p that 
determine the effective depth of the surface layers modified through melting or 
redistribution of atoms. Generally, the smaller values of r p result in submicron 



Table 13-1. Typical Parameters for Lasers Used in Surface Modification 


Laser 

Wavelength 

(M m ) 

Focused 
Average 
Power (W*) 

Focused 
optical spot 
(mm) 

Power 

density 
(MW cm -2 ) 

Dwell/pulse 
Length (ns) 

Repetition 
Rate (Hz) 

Spatial 

Distribution 

CW-C0 2 

10.6 

200-15000 

0.1-10 

1-10 

io 7 -io'° 

- 

Multimode, top hat, 
or Gaussian 

(^-switched 

Nd:YAG 

1.06 

0.5-3 

20-40 

80-200 

130 

11000 

Gaussian 

Frequency-doubled 

Q-s witched Nd:YAG 

0.53 

0.05-0.3 

10-30 

30-100 

120 

5000 

Gaussian 

Q-switched ruby 

0.69 

1-10* 

6 

10-200 

2-80 

0.03 

Top hat 

Pulsed ruby, Nd:YAG 
or Nd:glass 

0.69, 1.06 

2-50* 

0.5 

0.5-10 

2 x 10 6 -10 7 

0.1-10 

Multimode 


*For low repetition rate, laser output is quoted in J per pulse. 
From Ref. 2. 
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melt depths. Melting and extensive interdiffusion over tens to hundreds of 
microns occur with the longer irradiation times possible with cw lasers. No 
single laser spans the total range of accessible melt depths. Section 13.2.3 is 
devoted to the quantitative modeling of the temperature-distance-time interre¬ 
lationships in the heat-affected zone. 


13.2.2. Laser Scanning Methods 

Practical modification of large surface areas with narrowly focused laser beams 
necessarily implies some sort of scanning operation as shown in Fig. 13-3a. 
For cw lasers the surface generally rotates past the stationary beam in a 
manner reminiscent of a phonograph record past a needle. Through additional 
x-y motion, radial positioning and choice of rotational speed, a great latitude in 
transverse velocities (u) is possible. This also means a wide selection of 
interaction or dwell times, t d , given by t d = d m / v , where d m is the effective 



(a) 



Figure 13-3. Schemes for broad area modification of laser-heated surfaces, (a) 
Surface translated past stationary laser beam in X and Y directions, (b) Surface rotated 
relative to laser beam at low speed. Inserts show enlarged views of melt trails. (From 
Ref. 2). 
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melt trail diameter. Typically, t d ranges between tens of microseconds to 
hundreds of milliseconds. In this case the surface-modified region appears to 
consist of a chain of overlapping elliptical melt puddles. 

The experimental arrangement for processing using pulsed or Q-switched 
lasers is shown in Fig. 13-3b. In this case, discrete, overlapping circular-mod¬ 
ified (melted) regions are generated by a train of laser pulses. For area 
coverage larger than the individual melt spots, a mechanism for raster scanning 
must be provided. This is usually accomplished by computer-controlled x-y 
stepping of substrates. 


13.2.3. Thermal Analysis of Laser Annealing (Refs. 4, 5) 

The substrate heating caused by an incident laser pulse is due to electronic 
excitation processes accompanying the absorption of light. Typical pulse 
durations of 1 nsec or longer far exceed the relaxation time for electronic 


a. 


-i 



(b) a 1 >2v ^d T p 



Figure 13-4. Schematic laser pulse intensity and temperature profiles for a penetra¬ 
tion depth a -1 that is (a) small and (b) large compared with the thermal diffusion 
length, 2 y/K d r p . (From Ref. 3). 
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transitions (~ 10“ 12 sec). Therefore, it is permissible to assume that the 
thermal history of the irradiated sample can be modeled by continuum non¬ 
steady-state heat-conduction theory. The fundamental equation for the tempera¬ 
ture T{x, t) that has to be solved is 


pc 


9T(x,t) 

dt 


d z T(x, t) 


K 


dx 2 


- A(x, t) = 0; 


0 < x < oo, (13-1) 


where the first two terms representing conventional one-dimensional heat 
conduction should be familiar to readers. The term A(x, /), in units of 
W/cm 3 , is the spatial and time-dependent power density absorbed from the 
incident laser pulse. Other quantities which appear are p, the density, c, the 
heat capacity, k , the thermal conductivity, and x and /, the distance measured 
from the surface into the interior and time, respectively. Depending on the 
relative value of the absorption length, cm, of the laser light within the 
specimen surface, two limiting regimes of thermal response can be distin¬ 
guished, as shown in Fig. 13-4. 


13,2.3.1. Strong Thermal Diffusion (2^K d v p > a When the thermal 
diffusion length 2^~K/r^ (where K d the thermal diffusivity = k/ pc) is much 
larger than ct~\ then the heat source is essentially a surface source. This is 
the situation for metal surfaces where light penetration is extremely limited. 
Hence, we assume that A(x , t) can be written as 

A(x,t) = /„( 1 - R)8(x-0)H(t - r p ). (13-2) 

The individual factors physically express that a rectangular laser pulse of 
power density / 0 is incident for time r p , after which the pulse amplitude is 
zero. The Heaviside function H(t - r p ) mathematically describes this time 
dependence. A fraction of the incident radiant energy is reflected from the 
surface whose reflectivity is R. The remainder is concentrated only at the 
surface; hence, the use of the delta function, 6 (jy — 0). 

The boundary value problem that models laser heating in the semi-infinite 
medium is then expressed by the following conditions. Initially, 

7'(jc,O) = r 0 ; 0 < .Y < oo, (13-3a) 

where T 0 is the ambient temperature. The first set of boundary conditions 
concerns heat transfer through the surface. Thus, 

BT(0,t) _ 7 0 ( 1 -R) 


dx 


K 


x — 0, 


(13-3b) 



13.2 Lasers and Their Interaction with Surfaces 


597 


assumes a constant heat flux during heating, i.e., for r p > t > 0. For time 
t > t p corresponding to cooling, 

dT( 0, t)/dx = 0. (l3-3c) 

The second boundary condition specifies T far from the surface 

T(co,t) = T 0 . (13-3d) 

Closed-form solutions for both transient heating and cooling can be obtained 
by Laplace transform methods. During heating (t < r p ) 

7 oO ~ R ) /TTP 7 • r X 


T(x,t) = 


V ^7ierfc- 7 = T + T 0 , (13-4) 

2 V K d t 


where 


ierfc z = J erfc ydy = —j= e 


z erfc z. 


Explicitly, 

T{x, t) = T h {x, t) 


l 0 (\-R) [4Kj -x 2 
-:- \l - ex PTT7- 


During cooling the temperature drops for a \\ t > r p and 
T(x,t) = T c (x, t) 


7*( X > 0 


/ 0 (l-tf) /4 K d (t-r p ) 


exp — 


(13-5) 


+ 7- 0 . 


;i3-6a) 


4 K*{t ~ t p ) 


- x erfc - 


/*Kjt - Tp ) 


;i3-6b) 


In this development it has been tacitly assumed that K d is temperature-inde¬ 
pendent, and that the laser-beam diameter is larger than 2 yjK d t p . These 
assumptions have considerably simplified the analysis, the first by linearizing 
Eq. 13-1, and the second by justifying one-dimensional heat diffusion through 
neglect of the otherwise lateral heat flow. 

At the surface of the material (x = 0) Eq. 13-6a reduces to 


7(0, 0 = 


2/„(l -*) 


\l - + T o • 


(l3-7a) 
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and Eq. 13-6b similarly becomes 


r(o, o 


2/ 0 (l -R) 

K 




(l3-7b) 


Through differentiation of these equations with respect to time, the surface 
heating and quenching rates are calculated to be 

am.hUzvjn , I3 . 8a) 

dt K V Tt V } 


dT{t) _ i 0 {\-r) r rY~ d i ~ K d ~ ~ 

dt k V 7 rt y ir(t - r p ) ' 


(l3-8b) 


respectively. 

As an example, consider a 1.8-J/cm 2 ruby laser pulse of 10-nsec width 
incident on a Ni surface at T 0 - 0 °C, for which k = 0.92 W/cm-°C and 
K d = 0.98 cm 2 /sec. In this case / 0 = 1.8 x 10 8 W/cm 2 , and if R = 0.9, the 


PULSED SOLID STATE 



10' 8 10" 7 10" 6 ID' 5 10' 4 10- 3 ID' 2 1CT 1 

TIME, sec 


Figure 13-5. Calculated melt depths vs. irradiation time for Al, Fe and Ni based on 
one-dimensional computer heat flow model. (From Ref. 6). 
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time it will take the surface to reach the melting point (1455 °C) is calculated 
to be 4.4 nsec, using Eq. 13-7a. In another 5.6 nsec a maximum surface 
temperature of 2190 °C is attained. The thickness of Ni that has totally melted 
can be estimated with the use of Eq. 13-6a. Substituting T = 1455 °C and 
/ = 1 X 10“ 8 sec, trial-and-error solution yields a value x - 4300 A. Due to 
neglect of (1) radiation and convection heat losses, (2) latent heat absorption 
during melting and liberation upon solidification, and (3) temperature depen¬ 
dence of thermal constants—these calculated effects have been considerably 
overestimated. More precisely determined melt depths vs. irradiation time and 
input power are depicted in Fig. 13-5 for AL Fe, and Ni. Submicron melt 
depths for ^-switched laser pulses are typical. 

Lastly, it is instructive to estimate the melt quenching rate. The instanta¬ 
neous value is time-dependent (Eq. 13-8) and for the example given above, 
calculation at 20 nsec yields a rate of -3.2 X 10 10 °C/sec (for T = 896 °C). 
Similarly, the quench rate at 50 nsec is -8.37 X 10 9 °C/sec (for T = 580 
°C). Such ultrahigh quench rates from the liquid phase are sufficient to freeze 
in a variety of metastable chemical and structural states in many alloy systems. 


13.2.3.2. Adiabatic Heating (2^K d v p < a \ Ref. 7). In this regime the 
temperature of the surface is largely determined by the initial distribution of 
the energy absorbed from the laser beam. Light penetrates within the material 
and the thermal evolution during the pulse duration overshadows heat diffu¬ 
sion effects that can be neglected; adiabatic heating prevails then. Such a 
situation is applicable to the laser modification of semiconductor surfaces 
where the distribution of light intensity is given by 


/ = /„( 1 - R)e\p - ax W/cm 2 . (13-9) 

The heat generation rate is equal to - dl/dx , so 

A(x, t) = a/ 0 (l - /?)exp - ax W/cm 3 . (13-10) 

Upon substitution in Eq. 13-1 and neglecting t<(d 2 T/dx 2 ), we obtain a 
temperature rise 


*T(x,t p ) 


olt p I 0 { 1 - R)e \p - ax 

pc 


(13-11) 


at / = T p after integration. The threshold incident power required to just 
initiate surface melting is therefore 


^(threshold) = cp(T M - T 0 )/{\ - R)r p a, 


(13-12) 
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where T m is the melting point. Cooling rates can be estimated if it is assumed 
that heat transfer occurs by conduction over a distance a~\ Associated with 
a' 1 is the heat diffusion (quenching) time, i q = 1/(2 a 2 K d ). Therefore, a 
surface quench rate of 

A T/t q = -2a 3 / 0 (l - R)T p K d / P c (13-13) 

is predicted which, interestingly, depends on a 3 . 

As an application of these equations let us consider Si for which p = 2.33 
g/cm\ c = 0.7 J/g-°C, K d = 0.14 cm 2 /sec, R - 0.35 (at 0.69 pm), and 
a = 2.5 x 10 3 cm -1 . For the 10-nsec, 1.8-J/cm 2 ruby laser pulse previously 



Figure 13-6. Calculated temperature vs. time at different depths below Si surface 
irradiated with a 10-nsec ruby laser pulse of 1.7-J/cm 2 energy density. (From Ref. 8). 




13.2 Lasers and Their Interaction with Surfaces 


601 


considered, AT (x = 0, / = 10 nsec) is estimated to be - 1800 °C (Eq. 
13-11), and the quench rate is calculated to be - -4.8 x 10 9 °C/sec (Eq. 
13-13). These values are only approximate, and more exact computer analyses 
that account for thermal losses, latent heat effects and temperature (and 
position) dependent material constants exist. One such calculation for the 
temperature history at different depths beneath an irradiated single-crystal Si 
surface is shown in Fig. 13-6. 


13.2.4. Solidification Rate 


If directional solidification is assumed, then an estimate of the solidification 
rate, defined as the rate of movement of the melt interface, may be obtained by 
evaluating dx j dt directly from the prior heat flow analysis. Since 


dx 

~dt 



(13-14) 


the direct dependence on cooling rate and inverse dependence on temperature 
gradient should be noted. Brute-force calculation of the involved factors is 
tedious and best left to the computer. Instead, it is instructive to take an 
intuitive approach based on heat transfer considerations. During solidification, 
the latent heat of fusion, H /y liberated at the advancing solid-liquid interface, 
is conducted into the substrate. Therefore, the thermal power balance per unit 
area of interface, which limits the solidification velocity, is given by 


dx 

p H f = K c 

f dt s 




(13-15) 


Here p is the density, S and L refer to solid and liquid, respectively, and both 
derivatives must be evaluated at the melt interface. Because the molten film is 
roughly at constant temperature during solidification, K L (dT/dx)j may be 
neglected compared to x s (dT/dx) s . Note that in this formulation dx/dt is 
directly proportional to dT/dx |, s . For the case when 2^jK d t M > a 1 , the 
temperature gradient is roughly (T M - T {) )/2yjK d f M y where the denominator 
is a measure of the thermal diffusion length for melt time t M . Therefore, an 
estimate of the solidification rate is 


- = k s (T m - T [) )/2pH f ^Kj; i . (13-16) 

Again for the case of Ni where p = 8.9 g/cm 3 and H f = 151 cal/g, with 
other constants previously given, dx/dt = 5 x 10 3 cm/sec, assuming t M = 
10 nsec. In actuality, melt times are generally longer serving to reduce the 
value of dx / dt. 
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Alternatively, for the case where a 1 > 2 \jK d t M , direct differentiation of 
Eq. 13-11 yields 

dT/dx= -a AT = -a(T M - T 0 ). (13-17) 

After substitution in Eq. 13-15, we have 

dx/dt = K S a(T M - T 0 )/pH f . (13-18) 

Evaluating this equation for Si where H f ~ 264 cal/g, and other constants 
were previously given, we obtain dx/dt = 2.1 x 10 3 cm/sec. 

The estimates of dx/dt are strongly dependent on the laser pulse power and 
duration, and are probably too high by a factor of - 2 to 5. Nevertheless, 
these ultrahigh solidification rates of several meters/sec are many orders of 
magnitude larger than conventional solidification rates in bulk materials. For 
example, single crystals of Si are typically pulled at rates of only 3 x 10“ 3 
cm/sec. 


13.3. Laser Modification Effects and Applications 


13.3.1. Regrowth Phenomena in Silicon (Ref. 9) 

In this section attention is directed to the structural and compositional property 
changes produced in silicon surface layers as a result of laser processing. 
Silicon has been singled out as the vehicle for discussion because of the large 
volume of study devoted to this important material. Furthermore, many of the 
phenomena observed in Si can be readily understood in the context of 
traditional solidification and recrystallization theories that have evolved over 
the past four decades. 


13.3.1.1. Impurity-Free Si. Laser melting of single-crystal Si wafer sur¬ 
faces results in liquid phase epitaxial (LPE) regrowth. However, when 
ultrashort picosecond pulses are applied, crystalline -► liquid -+ amorphous Si 
transitions can be sequentially induced. 

The phenomenon of solid phase epitaxial (SPE) regrowth of amorphous 
silicon layers upon surface annealing is worth noting. As we shall see later, ion 
implantation methods can be used to “amorphize,” or make amorphous, 
surface layers of Si. The latter can be recrystallized by laser annealing, in what 
amounts to a second surface modification treatment. Better control over SPE 
can be exercised, however, by means of simple furnace annealing. The result 
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is a well-defined constant planar regrowth velocity with thermally activated 
kinetics given by 

dx E 

_ , (13-19) 

where a is the atomic spacing and v is the lattice frequency. The activation 
energy (E) for impurity-free SPE is 2.35 eV, a value that corresponds to 
cooperative bond breaking and rearrangement at the moving amorphous-crys¬ 
talline interface. In its wake, dangling bonds are eliminated and interfacial 
bond straining and distortion are minimized. In contrast to the several m/sec 
laser-induced solidification rates, SPE regrowth proceeds at a velocity of only 
— 1 A/sec at 500 °C—a difference spanning some 10 orders of magnitude. 
Epitaxial regrowth rates of Si-implanted amorphous Si are strongly dependent 
on substrate orientation with (100) and (111) exhibiting the highest and lowest 
magnitudes, respectively. 


13.3.1.2 . Doped Si 

Epitaxial Regrowth. Depending on the type and concentration of impurity 
atoms, and the thermal processing parameters, a rich assortment of regrowth 
effects is possible. Consider what happens to an implanted distribution of Bi 
atoms in Si after irradiation by a 100-nsec ruby laser pulse with an energy 
density of 2 J/cm 2 . As indicated in Fig. 13-7, the distribution, originally 

o 

centered 1500 A deep, is swept toward the surface during solidification. The 
reason for this has to do with two facts. The first is that when a liquid and solid 
are in equilibrium at some temperature, solute generally has greater solubility 
in the liquid phase; second, solute atoms diffuse rapidly in the liquid phase but 
are essentially immobile in the solid. When a planar liquid-solid front now 
passes by the implanted solute, successive partitioning occurs between the two 
phases in an attempt to maintain a fixed (equilibrium) solute concentration 
ratio; i.e., k 0 = C S /C L , at the interface. Also variably known as the segrega¬ 
tion, partition or distribution coefficient, k 0 values depend on the dopant in 
question; it can, for example, range from 2.5 X 10“ 5 for Au to 0.8 for B in 
Si. A computer simulation of the solidification sequence of events is shown in 
Fig. 13-8 for the case of a solute with k 0 = 0.1, where regrowth proceeds at a 
rate of 2 m/sec. The process resembles zone refining, a technique employed to 
purify rods of material by directionally sweeping impurities and concentrating 
them at one end by means of a moving molten zone. 

A recent intriguing result is the demonstration of the solid-state analog of 
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Figure 13-7. RBS depth profile of 250-keV Bi implanted in (100) Si (dashed line). 
After irradiation with 100-nsec ruby laser pulses of 2 J/cm 2 the full line profile results. 
Circles (triangles) represent random (aligned) RBS spectra. (From Ref. 10). 



0 1000 0 1000 0 1000 


THICKNESS(A) 

Figure 13-8. Computed solute profiles in the liquid and solid at the indicated times 
during solidification. (From Ref. 8). 

zone refining in an amorphous Si layer doped with implanted Au. During 
epitaxial regrowth of Si, the amorphous Si layer shrinks in extent as the 
interface moves toward the surface (Fig. 13-9). Simultaneously, Au preferen¬ 
tially partitions into the amorphous phase and is concentrated there leaving a 
virtually Au-free epitaxial Si region behind. Interestingly, Au solubilities in 
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Figure 13-9. Channeling RBS spectra showing amorphous Si thickness and Au depth 
profiles as a function of annealing time. (From Ref. 11). 

amorphous Si many orders of magnitude greater than in crystalline Si, were 
measured (Ref. 11). 

73.3.7.3. So/ufe Trapping Effects (Ref. 12). One of the consequences of 
rapid solidification is the trapping of solute within the regrown matrix, causing 
equilibrium solid solubility limits to be exceeded. The effects can be quite 
appreciable. For example, whereas the equilibrium solid solubility limit of Bi 
in Si is 8 x 10 17 /cm 3 values as high as 4 X 10 2O /cm 3 were measured after 
regrowth at velocities of 4.5 m/sec. Similarly, for As, Ga, Sb, and In, 
solubility enhancements of 4, 10, 30, and 200, respectively, have been 
reported. One explanation of the trapping process involves a competition 
between interface and impurity kinetics. The time (/,) required to regrow a 
layer equal to the interface thickness ( a ,) is /, = ajv^ where is the 
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solidification velocity dx/dt. Meanwhile, the average time the dopant spends 
in this region is t d » a] f2D iy where Z), is the diffusivity in the interface 
region. Trapping occurs if the dopant resides in the interface region longer 
than the time required to regrow it, i.e., when t d > t r On the other hand, 
when the reverse is true, dopant is rejected into the high-solubility phase, i.e., 
the liquid. The critical velocity is then u crit = IDJa^ values of a, = 10' 7 
cm and D i ^ 10“ 5 cm 2 /sec lead to u crit = 2 m/sec, which is in reasonable 
agreement with experience. 

Another explanation (Ref. 12) suggests that solute trapping is not due to 
atoms being unable to get out of the way of the rapidly advancing interface. 
Rather, atoms are incorporated into the crystalline lattice in greater concentra¬ 
tions than predicted by equilibrium thermodynamics. Evidence for this asser¬ 
tion is the much larger k 0 value of 0.1 (instead of the equilibrium value of 
7 x 10“ 4 ) required to fit the experimental Bi profile in Fig. 13-7. Also the 
lower RBS spectrum (triangles) exhibiting channeling indicates that the Bi 
dopant primarily resides on lattice, not interstitial sites. 

One manifestation of small solute additions is the very marked effect on 
epitaxial regrowth behavior. For example, less than 1 at % of B, P, and As 
increase the regrowth velocity while O, N, and C reduce it. It is well known 
that virtually all elemental additions impede the processes of recrystallization 
and grain growth in bulk metal alloys by impeding dislocation motion and 
acting as a drag on mobile grain boundaries. Evidently additional complex 
electronic effects of dopants at the amorphous-crystal (or liquid-epitaxial) 
interface are at play in semiconductors. 


13.3.1.4. Constitutional Supercooling. One of the interesting phenomena 
that can occur when the temperature increases into the growth medium is the 
existence of a region ahead of the interface that is effectively supercooled . 
The reason is due to the buildup of rejected solute into the liquid ahead of the 
solidification front, and the effect it produces is known as constitutional 
supercooling. It is so named because the effective melt temperature varies 
with solute concentration in a manner predicted by the binary phase or 
constitution diagram. If this constitutional temperature varies less rapidly 
with distance into the melt than the actual temperature, the growth interface is 
planar (no constitutional supercooling); if the reverse is true, local instabilities 
occur and the planar solidification front breaks down through growth of 
feathery projections known as dendrites, or through formation of a cellular 
substructure. The criterion for unstable interfacial motion is 


mC Q 1 
D k 0 


dx 

— > 


dT 

Hx' 


dt 


(13-20) 
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where m is the slope of the liquidus line on the phase diagram, C 0 is the 
composition of the liquid far from the interface, and the other quantities have 
been previously introduced. 

In the regime of laser processing, the inequality is frequently exceeded and 
interfacial instabilities develop. The recrystallization breakdown occurs on the 
scale of D/{dx / dt) = (10“ 4 cm 2 /sec)/(100 cm/sec) or about 100 A. 

13.3.2. Laser Surface Alloying (LSA) 

When a thin metal film A on a metal substrate B is exposed to laser radiation, 
the combination can be alloyed through melting to yield a new modified 
surface layer. This LSA process can be understood with reference to the 
schematic cross-sectional views shown in Fig. 13-10. A laser pulse causes film 
A to melt and the resulting liquid/solid front sweeps past the original A-B 
interface; interdiffusion of film and substrate atoms occurs as irradiation 
terminates. The maximum melt depth is reached in Fig. 13-10d, where the 
atomic mixing is quite vigorous. Resolidification then begins and the 
solid-liquid interfacial velocity, which is initially zero, increases very rapidly. 
Interdiffusion continues within the liquid but the resolidified metal cools so 
rapidly that atoms are immobilized and frozen in place. The final result is an 
alloy of nominal composition A^Bj.^ which is not necessarily homogeneous. 
As an example, consider the surface alloying of a thin Au film on a Ni 
substrate. Typical Q-switched laser pulses generate total melt times ( t M ) 
ranging from 50 to 500 nsec and produce atomic mixing over the diffusion 
distance ~ 2 y/Di M . The diffusivity of atoms in liquid metals is rarely 
outside the range of 10 -5 to 10 -4 cm 2 /sec so that compositional change can 

o 

be expected over a distance of 140-1400 A. The higher estimate roughly 
agrees with the Au concentration profile data of Fig. 13-11. In this example the 

o 

total melt depth is about 4500 A and considerably exceeds the initial Au 
thickness. If, on the other hand, t M is lengthened to — 50 jisec, a situation 
arising when using a cw-C0 2 laser pulse, then the molten pool is deeper and 
the diffusional length exceeds 4500 A. Complete homogenization clearly 
occurs in this case. 

Considerable LSA experimentation has been conducted on assorted binary 
alloy systems (Ref. 2). They include (1) those in which the two components are 
mutually soluble in both liquid and solid states, e.g., Cr-Fe, Au-Pd, W-V; 
(2) those where there is appreciable liquid solubility (miscibility) but limited 
solid solubility, e.g., Cu-Ag, Au-Ni, Cu-Zr; and (3) those that exhibit both 
liquid and solid phase immiscibility, e.g., Pb-Cu, Ag-Ni, Cu-Mo. Category 
1 lends itself to thermodynamically favored interdiffusional mixing. Except for 
rare or expensive alloying elements, however, laser processing in these 
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Figure 13-11. Surface alloying of Au film with a Ni substrate after irradiation with 
C0 2 and Nd:YAG lasers. (From Ref. 2). 

systems offers few advantages over conventional bulk alloying processes. 
Thermodynamic obstacles to mixing in category 3 are not easily overcome 
even with LSA methods. Only when the films are thin enough and the melt 
temperature high enough is there the chance that a single-phase liquid will 
form, which then can be quenched to retain metastable phases. Otherwise, 
predictable phase separation will occur. Intermediate category 2 offers the 
greatest potential for quenching in metastable and amorphous phases. It is this 
class of binary systems which had been previously studied by vapor-quenching 
methods (Chapter 5) over a decade earlier. The objective was the same as LSA 
—to extend solubility of terminal phases, freeze in metastable phases and 
produce amorphous phases by suppressing crystallization. Laser scanning 
methods offer the best means of achieving these ends over large surface areas. 


13.4, Ion-Implantation Effects in Solids 


13.4.1. Introduction 

Ion-surface interactions have already been discussed within several contexts in 
this book. Sputtering for film deposition and Rutherford backscattering for 






610 


Modification of Surfaces and Films 


microanalysis are the most important examples; typical ion energies involved 
are 5 keV and 2 MeV, respectively. At ion energies between these extremes, 
i.e., tens to hundreds of keV, the probability is great that projectile ions will be 
implanted hundreds to thousands of angstroms deep beneath the surface. 

As a surface modification technique, ion implantation has a number of 
important advantages as well as disadvantages (Ref. 13). Among the advan¬ 
tages are controllable and reproducible subsurface depth concentrations, no 
sacrifice of bulk properties, low-temperature processing, no significant dimen¬ 
sional change in implanted objects, extension of solid solubility limits, forma¬ 
tion of metastable phases, and vacuum cleanliness. Significant limitations 
include line-of-sight processing, shallow penetration of ions, lattice damage, 
and, of course, very high capital equipment and processing costs. Despite the 
latter drawbacks, ion implantation is not only indispensible in VLSI process¬ 
ing, but its use has been explored as a means of beneficially modifying 
virtually every surface property of interest. 

Modification of surfaces occurs because the newly implanted distributions of 
chemical species are accompanied by considerable structural disorder. Some¬ 
times it is possible to induce compositional change without appreciable struc¬ 
tural modification, e.g., in the case of low-dosage implants followed by 
thermal annealing. Alternatively, crystalline targets can be disordered struc¬ 
turally and even made amorphous by implanting ions that are identical to 
matrix atoms. In this case no chemical change is effected. Frequently, how¬ 
ever, both compositional and structural changes are inseparably linked and 
serve to broaden the number of possible ways surfaces can be modified. Some 
choice over the extent of modification can be exercised through control of 
processing variables. 

13.4.2. Energy Loss and Structural Modification 

The collisions of an individual energetic ion in a solid cause the motion of 
atoms and the excitation of electronic states. At the outset the ions primarily 
induce “gentle” electronic transitions that cause relatively little structural 
damage. Nevertheless, the electronic structure is excited by Coulomb interac¬ 
tions with moving ions. A relatively narrow lightning-like trail surrounds the 
ion track and defines a region of intense electronic excitation, e.g., ionization, 
secondary and Auger electron production, electron-hole pair formation, lumi¬ 
nescence, etc. Through these mechanisms of energy loss the ion slows suffi¬ 
ciently until it begins to set in motion violent nuclear collision cascades along 
its trajectory. These cascades are the result of displaced atoms that dislodge yet 
other atoms so that a jagged branched trail is produced. 



13.4 lon-Implantatlon Effects in Solids 


611 


Through these interactions the ion energy ( E ) continually decreases with 
distance (z) traversed in a very complicated way. For simplicity the energy 
loss is expressed by 

dE 

~Tz ^ N [ S e(E)+S H (E)\, (13-21) 


where S e (E ) and S n (E) are the respective electronic and nuclear stopping 
powers (in units of eV-cm 2 ), and N is the target atom density (in units of 
atoms/cm 3 ). The magnitudes of both stopping powers depend on the atomic 
numbers and masses of the ions as well as target atoms. Typically, electronic 
stopping results in energy losses of 5-10 eV/A as opposed to the higher losses 
of 10-100 eV/A for nuclear stopping. The Lindhard-Scharff-Schiott (LSS) 
theoiy (Ref. 14) has quantified the nature of these ion-sol id interactions and 
predicted the extent of partitioning of the energy loss. It is well established that 
S e (E) varies as E l/2 . The energy dependence of S n (E) varies widely with 
ion-matrix system; S n (E) is large at low E but relatively small and constant at 
high E. An explicit expression for S n (E) is 


. \ dE 7T 

SJE) = -= — aZ,Z 2 q' 

nV ' N dz 2 12 


M, 


Af, + M 2 ’ 


(13-22) 


where the atomic numbers and masses of the projectile ion and target nucleus 
are Z,, Z 2 , and M x , M 2 , respectively, and q is the electronic charge. Values 

o 

of a, the screening radius for the collision, lie between 0.1 and 0.2 A. In 
Chapter 3 it was noted that the sputter yield, which is related to S n (E ), rises at 
first, reaches a maximum, and then falls with increasing energy. At low 
energies S n > S eJ whereas the reverse is true at higher energy. The critical 
energy E c for which S e (E c ) = S n (£’ c ), depends on the nature of the ion-target 
combination. For example, in amorphous GaAs, E c = 15 keV for Be, and 800 
keV for Se. 

From the standpoint of structural modification, matrix damage and atomic 
relocation following nuclear collisions are cardinal issues (Ref. 15). In addition 
to the cascades referred to earlier, which can affect a considerable portion of 
the matrix, depending on the extent of the branch overlap, there are also 
‘‘spikes.” When the density of energy deposition in either electronic or 
nuclear cascades exceeds a certain level, the result is a thermal spike. These 
are launched when bombarding particles transfer energies of a few hundred eV 
to lattice atoms with the virtually instantaneous liberation of heat. In Cu, 
calculation has shown that the spike will heat a sphere of approximately 20 A 
in diameter to the melting point within 5 X 10' “ 12 sec. In another 3 X 10 11 
sec, the temperature decays to 500 °C, quenching the motion of some 1000 
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atoms in the process (Ref. 16). Small, highly energized cascade regions melt 
and quench too rapidly to grow epitaxially within the surrounding crystalline 
material. As a result, amorphous zones form. In silicon these have been found 
(Ref. 17) to be more resistant to formation of defect-free material on low-tem¬ 
perature annealing than thin amorphous-Si surface layers or films formed by 
laser melting or by physical vapor deposition. In this sense the amorphous 
cascade or spike regions have a different defective nature. 

The dose of ions required to amorphize a crystalline matrix can be roughly 
estimated by assuming that the energy density is essentially the same as that 
needed for melting. In the case of Si this amounts to about 20 eV/atom, or 


LOW DOSE << 10 13 « HIGH DOSE 

ions/cm 2 


INDIVIDUAL 

CASCADES 


CASCADE 

OVERLAP 



Figure 13-12. Schematic of individual cascades at low doses and overlapping cas¬ 
cades at high doses. R is typically 100 A, the collision cascade time is ~ 10' t4 sec 
and the thermal quench time is 10~ 12 sec. (Reprinted with permission from Plenum 
Publishing Corp., from Ref. 15). 
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— 10 24 eV/cm 3 . For ions of energy E 0 the dose 0 is 

0 = 10 24 R p /E 0 ions/cm 2 (13-23) 

where R p (soon to be discussed in the next section) is the ion range. In the 
case of 100-keV B ions, for which R p = 3200 A, 0 = 3.2 X 10 14 ions/cm 2 . 
In practice a dose greater than 10 16 ions/cm 2 is required, indicating that 
damage and recrystallization effects occur simultaneously during implantation. 

A montage of assorted electronic and nuclear cascades is depicted in Fig. 
13-12. Typical picosecond time scales rule out real-time monitoring of these 
events. Dimensions of several nanometers mean that high-resolution TEM 
techniques are required to image damage details. 


13.4.3. Compositional Modification 


From the previous discussion it is clear that no two ions will execute identical 
trajectories, but will participate in some admixture of nuclear and electronic 
collision events. Furthermore, the collective damage and zigzag motion of ions 
within the matrix cause them to deviate laterally from the surface entry point. 
When summed over the huge number of participating ions, these factors lead to 
the statistical distribution of ions as a function of position shown in Fig. 13-13. 
The concentration of implanted ions ideally has a Gaussian depth dependence 
given by 


C{z) 


<t> I z-R p \ 2 

v^A R/ XP \V2AR p ) 


(13-24) 



Figure 13-13. Distribution of implanted ions as a function of depth. 
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with a peak magnitude varying directly as the fluence or dose 0 of incident 
ions. Dose has units of number (of ions) per cm 2 and is related to the measured 
time integrated current or charge Q deposited per unit surface area A. 
Specifically, 

0 = Q/nqA , (13-25) 

where n is the number of electronic charges, q, per ion. The projected range 
R p is the depth most ions are likely to come to rest at, yielding a peak 
concentration of 0/ ^2tt A R pt Note that the actual distance an ion travels is 
greater than the component projected normal to the target surface. This is 
analogous to the total distance executed in atomic random walk jumps exceed¬ 
ing the net diffusional displacement. Experimental spread in the ion range is 
accounted for by the term A/? ;J , the standard deviation or longitudinal 
“straggle” of the distribution. Similarly A R ly the lateral ion straggle, is a 
measure of the spread in the transverse direction. 

Values of R p and A R p have been calculated to accuracies of a few percent 
for many ion-target atom combinations as a function of energy (Ref. 18). 
Results for N and Cr in Fe are shown in Fig. 13-14. For the same energy the 
lighter N ions penetrate further as intuitively expected. Typically R p is about 
0.1 /im. Similar information for the common dopants B, P and As in Si, Si0 2 
and photoresist matrices, is commonly used in designing VLSI implantation 
doping processes, and can be found in standard texts on the subject (Refs. 19, 
20). Our primary concern, however, is not with semiconductors and it is, 



ENERGY (KeV) 

Figure 13-14. Calculated values of R p and A R p for N and Cr in Fe as a function of 
energy. (From Ref. 18). 
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therefore, instructive to mention a number of factors that distinguish these two 
kinds of ion-implantation applications. 

1. The chief one from which many differences stem is the dose employed. In 
semiconductor technology typical doses of 10 12 to 10 I5 /cm 2 , or about a 
monolayer at most, lead to dopant volume concentrations of less than 0.1 
at%. In nonsemiconductor applications fluences of 10 15 - 10 l7 /cm 2 are 
common and result in atomic concentrations of several percent. 

2. A consequence of high implanted concentrations is the deviation from 
predicted Gaussian profiles which strictly prevail in the low-dose regime. 
Concentration-dependent chemical affinities, radiation-enhanced diffusion, 
and higher target temperatures because of beam heating, contribute to 
altering the implanted profile in the high-dose regime. 

3. Greater surface sputtering arises from high-dose implants. This is an 
obvious consequence of the higher ion flux bombardment and causes the 
target surface to recede. The effect can be appreciable as shown in Fig. 
13-15 for the case of 150-keV Ni ions implanted into 430 stainless steel at a 

CALCULATED 
ORIGINAL FINAL 



0 200 400 600 800 

DISTANCE (A) 


Figure 13-15. Comparison of calculated (LSS) and measured (ESCA) distribution of 
Ni implanted into stainless steel, (Reprinted with permission from Elsevier Sequoia, S. 
A., from S. B. Agarwal, Y. F. Yang, C. R. Clayton, H. Herman and J. K. Hirvonen, 
Thin Solid Films 63, 19, 1979). 
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dose of 5 X 10 17 /cm 2 . Together with radiation enhanced diffusion, the 
apparent preferential sputtering of Cr results in the buildup of Ni at the 
surface and the gross alteration of the Gaussian profile. 

4. In semiconductor applications it is essential that the structurally damaged 
matrix be restored to crystalline perfection and that the dopants be electri¬ 
cally activated. A high-temperature (~ 900 °C) anneal fosters both defect 
annihilation and crystal regrowth, where dopant incorporation at desired 
lattice sites occurs. No such post-implantation treatment is normally carried 
out in the nonsemiconductor applications. On the contrary, the reaction 
and retention of nonequilibrium surface phases and structures, not possible 
via conventional thermodynamic and diffusional routes are frequently 
desiderata. 


13.5. Ion-Beam Modification Phenomena and Applications 


13.5.1. Ion-Beam Mixing (Ref. 22) 

Ion-beam mixing phenomena deal with compositional and structural changes in 
a two- or multiple-component system under the influence of ion radiation. The 
effect commonly occurs during sputtering and results in changes in surface 
composition during depth profiling analysis by SIMS and AES techniques. For 
example, consider a thin film of A on substrate B bombarded by a beam of 
inert-gas ions. Typically, the ion range (R) exceeds the escape depth of the 
sputtered A atoms. If R does not exceed the thickness of A, then only A atoms 
sputter. If, after some sputtering, R now extends into the substrate region, the 
atomic displacements and interdiffusion that occur within collision cascades 
will cause A and B to intermix. The mixing occurs locally at the interface and 
eventually links with other similarly intermixed zones to create a continuous 
ion-beam mixed layer. Now B atoms also enter the stream of sputtered atoms 
because the combination of continued surface erosion and interfacial broaden¬ 
ing, due to ion mixing, has brought them closer to the surface. 

It is of interest to estimate the extent of interfacial broadening. To do this, 
we set R equal to the half-width of the broadened ion-mixed layer and assume 
that nuclear energy loss dominates with a stopping power independent of 
energy (Ref. 23). Therefore, from Eq. 13-21, 

rR ro dE E 0 , 

R ~ L dZ ~ L 0 MS„(E) “ ( dE/dz)„ ’ 13 26 
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where {dEjdz) n = - NS n {E). Direct substitution in Eq. 13-22 for the case of 
Ar in Cu shows that ( dE/dz) n 125 eV/A (for a = 10~ 9 cm). Thus, for 

o 

every keV of ion energy the altered layer extends about 8 A. 

Through the use of high-energy ion beams, mixing reactions occur over 
substantial dimensions. Films can, therefore, be effectively alloyed with sub¬ 
strates and layered, but normally immiscible films can be homogenized with 
the assistance of ion implantation. As an example, consider the multiple-layer 
structure consisting of alternating Au and Co films shown in Fig. 13-16. 
According to the phase diagram, these elements do not dissolve in each other; 
but they form a uniform metastable solid solution under a flux of 3 x 10 15 Xe 
ions/cm 2 at an energy of 300 keV. Collision cascades, ballistic effects of 
recoils, and defect migration during room-temperature irradiation all con¬ 
tribute, in a complex way, to the observed mixing. 



Figure 13-16. (a) RBS spectra showing formation of a uniformly mixed Au a .Co,_^ 

solid solution layer by ion-beam mixing of Au-Co multilayers, (b) RBS spectra of 
ion-mixed Au-Co films with compositions Au 75 Co 25 , Au 50 -Co 50 , and Au 25 Co 75 . 
(Reprinted with permission from Plenum Publishing Corp., from Ref. 22). 
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Among the systems that have been researched for ion-beam mixing effects 
are (Ref. 22) 

1. Silicides: Pt, Pd, Mo, Nb, Cr 

2. Bilayer metal-metal films: Al-Ni, Al-Pd, Al-Pt, Au-Ag, Au-W, Cu-Au 

3. Multilayer metal-metal films: Au-Co, Au-Ni, Au-V 

4. Eutectics: Au-Si 

In silicides, equilibrium as well as metastable phases have been observed 
after mixing. Experiment has shown that the silicide thickness is both dose- 
and ion-species-dependent. At the same energy and dose, more mixing occurs 
the heavier the ion. In metal film systems, extended solubility is virtually 
always observed, metastable phase formation is a frequent occurrence, and 
amorphous phases occasionally form at cryogenic temperatures. 

13.5.2. Modification of Mechanically Functional Surfaces 
(Refs. 13, 24) 

By enhancing the ability of surfaces to resist plastic deformation, the benefits 
of reduced wear, less tendency to surface cracking, and greater dimensional 
stability are effected. In recent years there has been considerable research on 
the use of ion implantation to realize these desirable ends. For the case of steel, 
the implantation of light interstitial ions, such as nitrogen, boron, and carbon, 
yields considerable improvement in wear and fatigue resistance. The reason is 
due to the elastic interaction between dislocations and undersized interstitials; 
this results in their mutual attraction and the segregation of the atoms to the 
defects. Long known in metallurgical circles, the interaction occurs even at 
room temperature and is quite effective in pinning dislocations, thus restricting 
their motion. In addition, iron nitrides, carbides, borides, etc., form when the 
limited solubilities of the interstitial atoms are locally exceeded; these precipi¬ 
tates are also effective barriers to dislocation movement. Since surface damage 
processes depend on plastic flow of surface layers, the importance of limiting 
dislocation motion is apparent. 

An alternate approach to improving the wear resistance of surfaces through 
the deposition of hard coatings was addressed at length in Chapter 12. It is 
instructive to compare this approach with that of ion implantation. Although 
CVD deposits manage to conformally coat external as well as internal surfaces, 
ion implantation is limited by geometry to line-of-sight processing. CVD 
coatings are bonded to the substrate across an interfacial region, which is 
frequently a source of adhesion difficulty; on the other hand, ion-implantation 
modified layers are not subject to adhesion problems because no sharp inter- 
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face exists. Since CVD deposition is conducted at elevated temperatures, the 
substrate is frequently heat-affected and sometimes softened in the process; ion 
implantation only modifies a very thin surface layer, leaving the remainder of 
the substrate unaffected. Lastly, thick CVD coatings several microns thick 
imply less stringent substrate smoothness requirements than for ion-implanta¬ 
tion processing. The latter is the only practical method available for modifying 
precision surfaces while preserving extreme dimensional tolerances. Clearly, 
ion implantation is only cost-effective for high-value added components, such 
as surgical implants or dies. 

The issue of the effective surface depth modified by ion implantation is an 
interesting one. It has been observed that ion implantation effects persist well 
beyond the shallow depth of the projected ion range. Implanted atoms are 
frequently observed considerably deeper within the substrate than can be 
accounted for by the geometry of wear tracks. The cause has been attributed to 
the generation of fresh dislocation networks that effectively trap and drag 
atoms deeper below the damaged surface layers. Frictional wear is also 
accompanied by temperature increases of as much as 600-700 °C at contacting 
asperities. Migration of mobile impurities is thus encouraged, especially where 
the dislocation density is high. Such effects provide an unexpected wear 
protection bonus for ion-implanted surfaces. 

A number of industrial applications involving wear reduction by means of 
ion implantation methods is listed in Table 13-2. Hardness and resistance to 
adhesive and abrasive wear are the attributes required of the assorted cutting, 
mechanical forming, and molding tools. In steel matrices nitrogen is a favored 
interstitial ion, and implanted cobalt has been explored as a means of modify¬ 
ing tungsten carbide tools. Typical doses are well into the 10 17 /cm 2 range and 
impart two- to fivefold decreases in wear rate with corresponding increases in 
tool life. Specific examples of ion-implantation modified tools and components 
are shown in Fig. 13-17. 

A totally different application which nevertheless exploits the benefits of 
enhanced hardness and reduced wear involves metallic surgical implants. Tens 
of thousands of titanium alloy (Ti-6 wt%-Al-4 wt% V) hip and knee replace¬ 
ment prostheses have already been ion-implanted with nitrogen resulting in 
improved tribological properties. In service, the implant moves in contact with 
a high-molecular-weight polyethylene mating socket, so that wear of this 
couple is of concern. Apparently, the formation of hard Ti0 2 particles results 
in abrasion of the unimplanted alloy surface during use. Implantation produces 
a surface containing hard titanium nitride precipitates that effectively resists 
such wear. Part of the improvement in properties may be attributable to the 
enhanced corrosion resistance ion-implanted surfaces exhibit. High defect 
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Table 13-2. Industrial Applications of Ion Implantation for Wear Reduction 



Application 

Material 

Treatment 

Dose(10 l7 /cm 2 ) 

Result 

1 . 

Paper slitters 

1.6 Cr-1C Steel 

8-N 

Cutting life 
increased 2 X 

2. 

Acetate punches 

Cr-plate 

4-N 

Improved life 

3. 

Taps for threading 
plastic 

High-speed 

steel 

8-N 

Life increased 5 x 

4. 

Slitters for synthetic 
rubber 

WC-6% Co 

8-N 

Life increased 2 x 

5. 

Tool inserts 

4Ni-lCr Steel 

4-Co 

Contamination 

reduction 

6. 

Forming tools 

12Cr-2C Steel 

4-N 

Reduced adhesive 

wear 

7. 

Dies for Cu rod 

WC-6% Co 

5-C 

Throughput increased 3 X 

8. 

Drawing dies 

WC-6% Co 

2-Co 

Improved life 

9. 

Dies for steel wire 

WC-6% Co 

3-C 

Wear rate reduced 3 X 


From Ref. 24. 



Figure 13-17. Examples of ion-implanted components including a die for making 
Band Aids, knee and hip surgical prostheses and cryogenic pistons. (Courtesy of J. K. 
Hirvonen and B. C. Haywood, Spire Corp.) 
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concentrations promote thickening of air-formed oxide films and enhance 
chemical homogenization of the underlying metal. The former effect affords an 
added measure of surface passivation and protection, and the latter helps 
eliminate localized galvanic corrosion. Lastly, note that there are no practical 
alternatives to modifying the surface properties of orthopedic prostheses. 
Unlike tools whose surfaces can tolerate CVD or PVD coatings, chemical 
biocompatibility with contacting body fluids places severe restrictions on the 
surface composition of surgical implants. 


13.5.3. Modification of Surface Morphology (Refs. 25, 26) 

In addition to chemical and structural modification of surface layers, ion beams 
can, interestingly, alter the topography of surfaces on which they impinge. 
Ions of moderate energy, i.e., less than 1 keV on up to tens of keV, are 
necessary. Under bombardment various micron-sized surface structures such 
as cones, pyramids, ridges, ledges, pits, faceted planes as well as quasi-liquid- 
like and microtextured labyrinth-like features develop. An example of cone 
formation on a Cu single crystal after 40-keV Ar bombardment is shown in 
Fig. 13-18. The term cone formation is used to generically categorize this 
class of topological phenomena. 

Known for half a century, cone formation has been interpreted in terms of 
either left-standing or real-growth models. The former model views conical 
projections arising from sputter-resistant impurities or intentionally deposited 



Figure 13-18. Pyramid structures on a single-crystal Cu surface after 40-keV Ar ion 
bombardment. (From Ref. 27). 



622 


Modification of Surfaces and Films 


seed atoms (e.g., Mo) that etch or sputter at a lower rate than the surrounding 
surface (e.g,, Cu). Sputter-yield variations as a function of ion incidence angle, 
nonuniform redeposition of atoms on oblique cones, and surface diffusion 
effects are operative in this mechanism. The latter model (Ref. 25) likewise 
requires the presence of impurity seed atoms. In this case they serve as 
nucleating sites for growth of genuine single-crystal whiskers. These sprout in 
all crystallographic directions with respect to ion incidence. The subsequent 
interplay among whisker growth, surface diffusion, and sputtering results in 
the observed cones. 

The following experimental facts regarding seed cone and whisker formation 
during ion bombardment have been established (Ref. 25). 

1. Very small amounts of metal impurity or seed atoms are required. Different 
impurity seed atoms for Cu include Ta, Mo, Fe, Cr, and Ti. Cones have 
been observed on many different metal surfaces, e.g., Al, Au, Pb, and Ni. 

2. Contrary to long-held belief, the seed atoms need not have a lower sputter 
yield than matrix atoms. They apparently must have a higher melting 
point. 

3. At low ion energies (< 1 keV) an elevated target temperature (i.e., one 
third of the melting point) is required for seed cone formation. 

4. When the bombarding energy is close to the sputter threshold, whiskers 
may be observed. At higher ion energy, thin slanted whiskers evolve into 
cones as they collect sputtered atoms from surrounding areas. 

5. Pure metal surfaces without any trace of impurities or seed atoms do not 
develop cones upon exposure to ion beams. 

Potential applications for rough surfaces containing cones or whiskers are 
selective light absorbers, catalysts, electron emitters, adherent interlocking 
surfaces for surgical implants or for joining processes. Thus far, however, no 
commercial exploitation of such ion-beam-modified surfaces has occurred. 


13.5.4. Ion-Beam Modification of Silicon (Refs. 28, 29) 

The chapter closes with a brief description of two novel modification processes 
in Si that do not involve traditional low-dosage semiconductor doping. 
Through implantation of 0 + at fluences of - 1.5 X 10 18 ions/cm 2 , at 
energies of — 200 keV, there is sufficient oxygen present to form a subsurface 
layer of stoichiometric Si0 2 . During subsequent annealing, the damage in the 
overlying Si is repaired, and uphill diffusion of oxygen sharpens both Si0 2 -Si 
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.MICROSTRUCTURE OF Si/SiO,/Si vs, TEMPERATURE 



Figure 13-19. TEM cross-sectional micrographs of Si implanted with 1.8 x 10 18 
0 + cm -2 and subsequently annealed at 1150 °C-2 h, 1250 °C-2 h and 1405 °C-0.5 
h. (Courtesy of G. K. Celler, AT&T Bell Laboratories, Reprinted with permission from 
Cowan Publishing Company, from G. K. Celler, Solid State Technology 30 (3), 69, 
1987). 

interfaces. As shown in Fig. 13-19, heat treatments at temperatures of 1250 °C 
do not completely anneal out dislocations, twins, and Si0 2 precipitates. 
Temperatures of 1405 °C, which are close to the melting point of Si, are, 
however, particularly effective in the chemical segregation of implanted O into 
the buried Si0 2 and in recovering the crystallinity of Si. Ion-beam synthesis of 
buried Si0 2 , known as SIMOX (separation by implanted oxygen) is the third 
and perhaps most promising SOI process mentioned in this book (see page 
333). As a postscript, CMOS dual-modulus prescaler circuits were recently 
(1989) fabricated at AT&T Bell Laboratories within the upper Si layer thinned 
from 2400 to 1200 A. The reported operating speed of 6.2 GHz was 50% 
higher than in the group of control circuits, and represents the highest reported 
speed in practical digital circuits. 

The second ion-beam modification process, known as mesotaxy, concerns 
the formation of buried epitaxial disilicide layers in Si. To achieve this feat, 
200-keV Co + ions are implanted into Si at fluences of — 2 x 10 17 ions/cm 2 . 
This is followed by annealing treatments at 1000 °C during which time the Co 
distribution narrows and the surrounding lattice damage is significantly re- 

o 

duced. Abrupt epitaxial silicide layers buried 1000 A beneath the Si surface 
can be produced this way (Ref. 29). 

The formation of both high-quality buried insulating (Si0 2 ) and conducting 
(CoSi 2 ) films beneath crystalline Si represent important advances in the 
eventual realization of three-dimensional integrated circuits. 
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Exercises 



1. An atomistic model for the solid-liquid interfacial velocity during solidi¬ 
fication suggests that 

(Eq. 1-34) 

Calculate a value of v for Si, assuming that the atomic diffusivity in the 
liquid state is 1CT 4 cm 2 /sec, v = 10 13 sec -1 , a Q = 2.7 A, and that the 
free-energy difference between Si atoms in the liquid and solid is equal to 
the latent heat of fusion. How does the value of v compare with 
interfacial velocities calculated in this chapter? 

2. Comment on the probability of successfully alloying the surface of the 

o 

following 2000-A film-bulk substrate combinations by laser processing: 
Zn-Mo, Mo-W, Fe-Ni, Ta-Cd. 

3. A 50-nsec, 37.5-mW/jtm 2 Kr ion laser pulse (X = 6470 A) is incident on 
TeGe, a candidate material for phase-change optical data storage applica¬ 
tions. For TeGe, R = 0.65, p = 6.2 g/cm 3 , k = 0.015 cal/cm-sec-°C, 
c - 0.08 cal/g-°C, and 

2 > a' 1 

a. Plot the surface temperature as a function of time during heating as 
well as cooling. 

b. What is the maximum surface temperature reached? 

c. What is the maximum quench rate at the surface? 

4- An amorphous Si film on a flat single-ciystal Si wafer is annealed. 
Schematically sketch and distinguish between the resultant grain struc¬ 
tures if (a) random nucleation and growth or (b) solid-phase epitaxy 
governs the solid-state crystallization process. 

5- For corrosion resistance a surface distribution of Cr in steel that is 
relatively constant as a function of depth is required. Show that succes¬ 
sive overlapping implants is a way to achieve this. Consider three such 
implants with Cr ion energies of 50, 100, and 200 keV. If the peak 
concentration in each case is maintained at 20 at% Cr (the stainless 
composition), what must the ratio of the doses be? Sketch the resultant 
profile of the overlapping distributions. 



Exercises 


625 


6 - Show that when the as-implanted Gaussian distribution given by Eq. 
13-24 is thermally annealed, promoting atomic diffusion for time /, the 
resulting distribution 


C(z, t) 


<t> [ I z-R p \ 

v/2x(A R* + 2 Dt) ^ 2\(AR 2 p + 2Dt) 1/2 ) 


satisfies the diffusion equation. What is the physical significance of the 
term 2 Dt y and its addition to the term A R 2 p . 

7. Steel dies are implanted with 200-keV N ions to enhance wear resistance. 

a. If the dose is 4 x 10 l7 , what is the peak N concentration? 

b. The Fe-N phase diagram indicates that the solid solubility of N in Fe 
is about 0.01 at% at room temperature. Excess amounts of N precipi¬ 
tate as Fe 2 N. Based on this information, what is the phase constitution 
of the matrix following ion implantation? 

c. Compare the times required to reach the solubility limit of N in Fe 
(~ 0.4 at% at 590 °C) 1000 A beneath the surface by ion implanta¬ 
tion, and by nitriding. Assume the ion-beam current density is 50 
ix A/cm 2 and that the diffusivity of N in Fe is 

D - 0.003exp - 18.2 (kcal/mole)/RT (cm 2 /sec). 


An NH 3 surface source is used for nitriding and the Fe is initially free 
of N. 


8 . a. Qualitatively contrast the rate at which energy is lost from a spherical 
thermal spike of radius L and an energized plate of the same thick¬ 
ness. The spike and plate are initially at the same temperature within 
the same matrix. 

b. The average change in the temperature of a sphere initially at tempera¬ 
ture T t is 


A t) 
r, 


6 


f 1 

2. -2 exp - 
n= 1 n 


n 2 w 2 K( 

~D 


where K is the thermal diffusivity. For the plate, 


At) 

T, 


8 - 1 

2 , -rexp - 

* n= i (2 n + 1) 


(2 n + l) 2 


n 2 ir 2 


Kt 

A 


4 
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To confirm your intuition, plot 

T(t) Kt 

~f~ ys ' Z 2 

for both geometries using values of Kt / L 2 = 0.1, 0.2,. . . ,0.9, 1.0. 

9, Suppose it is desired to selectively harden the flat surface of a steel die 
only in certain areas through ion implantation of 100-keV N ions. Other 
areas are protected against implantation by electrodeposition of a pat¬ 
terned Ni coating that serves to attenuate the ion beam. If specifications 
call for a maximum penetration of 1% of the beam, what coating 
thickness d of Ni is required? Assume Ni and Fe are alike with respect to 
N implantation effects. [Hint: What does 

r 0 _ / ^ - R P \ 2 J 

J d N/27A /?/ XP \V2AR p J dZ 

physically represent?] 

10- In typical SIMOX processing 2 x 10 18 atoms of O are implanted per cm 2 
of Si at 200 keV. After a high-temperature anneal (above 1300 °C for 

o 

6 h) a 4000-A-thick layer of Si0 2 , having sharp parallel interfaces, forms 
buried beneath a 2000-A-thick surface layer of Si. From this information 
estimate values of R p and A R p for O implanted into Si at 200 keV. 
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Emerging Thin-Film 
Materials and Applications 


In this final chapter an attempt is made to present a perspective of some 
emerging thin-film materials and applications that promise to have a significant 
impact on future technology. For this reason the discussion will be limited to 
the following topics: 

14.1. Film-Patterning Techniques 

14.2. Diamond Films 

14.3. High T c Superconductor Films 

14.4. Films for Magnetic Recording 

14.5. Optical Recording 

14.6. Integrated Optics 

14.7. Superlattices 

14.8. Band-Gap Engineering and Quantum Devices 

This potpourri of subjects encompasses covalent, metallic, and semiconduc¬ 
tor film materials deposited by an assortment of PVD and CVD methods. 
Represented are mechanical, electrical, magnetic, and optical properties, whose 
optimization hinges on both processing and the ability to characterize struc¬ 
ture-property relationships. Thus the spirit of materials science of thin 
films—the theme and title of this book—is preserved in microcosm within this 
chapter. For completeness however, it is necessary to start with Section 14.1, 
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which is devoted to the topic of thin-film patterning techniques. This subject is 
crucial to the realization of the intricate lateral geometries and dimensions that 
films must assume in varied applications, particularly some of those in this 
chapter. 


14.1. Film-Patterning Techniques 


14.1.1. Lithography 

Until now the only film dimension considered has been the thickness, which is 
controlled by the growth or deposition process. However, irrespective of 
eventual application, thin films must also be geometrically defined laterally or 
patterned in the film plane. The complexity of patterning processes depends on 
the nature of the film, the feature dimensions, and the spatial tolerance of the 
feature dimensions. For example, consider an evaporated metal film that must 



Figure 14-1. Schematic of the lithographic process for pattern transfer from mask to 
film. Both positive and negative resist behavior is illustrated. 
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possess features 1 mm in size with a tolerance of ±0.05 mm. The desired 
pattern could possibly be machined into a thin sheet stencil or mechanical 
mask. Direct contact between this mask template and substrate ensures genera¬ 
tion of the desired pattern in uncovered regions exposed to the evaporation 
flux. This method is obviously too crude to permit the patterning of features 
100 to 1000 times smaller in size that are employed in integrated circuits. Such 
demanding applications require lithographic techniques. 

The lithographic process shown schematically in Fig. 14-1 consists of four 
steps. 


14.1.1.1. Generation of the Mask . The mask is essentially equivalent to 
the negative in photography. It possesses the desired film geometry patterned 
in Cr or FeO thin films predeposited on a glass or quartz plate. Masks for 
integrated circuit use are generated employing computer-driven electron beams 
to precisely define regions that are either opaque or transparent to light. 
Other processing steps to initially produce the patterned mask film parallel 
those used in subsequent pattern transfer to the involved film. 


14.1.1.2. Printing. Printing of this negative mask requires the physical 
transfer of the pattern to the film surface in question. This is accomplished by 
first spin-coating the film-substrate with a thin photoresist layer (< 1 pm 
thick). As the name implies, photoresists are both sensitive to photons and 
resistant to chemical attack after exposure and development. Photoresists are 
complex photosensitive organic mixtures, usually consisting of a resin, photo¬ 
sensitizer, and solvent. During exposure, light (usually UV) passes through the 
mask and is imaged on the resist surface by appropriate exposure tools or 
printers. Either full-scale or reduced latent images can be produced in the 
photoresist layer. There are two types of photoresists and their behaviors are 
distinguished in Fig. 14-1. The positive photoresist faithfully reproduces the 
(opaque) mask film pattern; in this case light exposure causes scission of 
polymerized chains rendering the resist soluble in the developer. Alternatively, 
negative resists reproduce the transparent portion of the mask pattern because 
photon-induced polymerization leaves a chemically inert resist layer behind. 
For yet greater feature resolution X-ray and electron-beam lithography tech¬ 
niques are practiced. 


14.1.1.3. Etching. After resist exposure and development, the underlying 
film is etched. Wet etching in appropriate solutions dissolves away the exposed 
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film, leaving intact the film protected by resist. Equal rates of lateral and 
vertical material removal (isotropic etching) however, lead to loss of resolution 
due to undercutting of film features. This presents a problem in VLSI 
processing where 1 /xm (or so) features must be defined. For this reason dry 
etching is practiced. Material is removed in this case through exposure to 
reactive plasmas that interact with film atoms to produce volatile by-products 
that are pumped away. For example, typical dry etchants for Si, Si0 2 and A1 
are SF 6 -I- Cl 2 , CF 4 + H 2 , and BC1 3 + Cl 2 gas mixtures, respectively (Ref. 
1). Alternatively, inert-gas plasmas are also employed to erode the film surface 
in a process that resembles the inverse of sputtering deposition. In both cases, 
positive ion bombardment normal to the surface leads to greater vertical than 
horizontal etching, i.e., anisotropic etching. Steep sidewall topography and high 
aspect ratio features such as shown in Fig. 14-2 are the result of anisotropic 
material removal. 

An important issue in dry etching is the etchant selectivity or ability to 
preferentially react with one film species relative to others that are present. 
Simply changing the plasma gas composition can significantly alter etching 
selectivity. For example, the Si0 2 etch rate exceeds that of poly-Si by only 
25% in a pure CF 4 plasma. In an equimolar mixture of H 2 + CF 4 , however, 



Figure 14-2, SEM micrograph of reactive plasma-etched pattern in photoresist re¬ 
vealing development of submicron features. (Courtesy of L. F. Thompson, AT&T Bell 
Laboratories). 
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the etch rate of poly-Si drops almost to zero; the selectivity or ratio of etch rate 
of Si0 2 relative to poly-Si exceeds 45 (Ref. 1). 

14.1.1.4. Resist Removal. The final step requires removal of the resist. 
Special resist stripper solutions or plasmas (e.g., 0 2 rich) are utilized for this 
purpose. What remains is a high fidelity thin-film copy of the mask geometry. 

Only the briefest summary of the basic steps comprising the very important 
technology of lithography has been presented. For more detailed accounts of 
mask production (Ref. 2), photoresists (Ref. 3), printing (Ref. 4), and etching 
(Ref. 1) the reader is referred to the indicated references. 

14.1.2. Silicon Micromachining 

Silicon micromachining can be defined as a high-precision shaping technique 
that uses photolithographic and etching methods to form miniature three-di¬ 
mensional shapes in Si (and Si0 2 ) such as holes, wells, pyramids, grooves, 
hemispheres, needles, etc. In the same way that Si has revolutionized electron¬ 
ics, this versatile material has altered conventional perceptions of miniature 
mechanical components, devices and systems. Though small, micromachined 
features are generally large compared to VLSI dimensions. Examples include 
the microcantilever thin film beams discussed on p. 412, tiny gears, valves, 
springs and tweezers, X-ray Fresnel lenses, pressure and strain transducers, 
ink jet nozzle arrays, electrochemical sensors, multisocket electrical connec¬ 
tors, and force and acceleration transducers (Refs. 5, 6). Among the recent 
developments are the fabrication of a triode vacuum microelectronic device 
(Ref. 7) and an optical microassembly. The former shown in Fig. 14-3a is 
impervious to radiation damage, insensitive to heat with the potential for very 


VACUUM SPACE 



METAL ANODE 
INSULATING LAYER 
METAL GATE OR GRID 
DIELECTRIC (SUCH AS S\O z ) 
METAL EMITTER 
SILICON 

SUBSTRATE 


Figure 14-3a. Schematic structure of Si triode vacuum microelectronic device. 
(From Ref. 7). 



634 


Emerging Thin-Film Materials and Applications 



Figure 14-3b. SEM micrograph of optical microassembly. (Courtesy of K. L. Tai, 
AT&T Bell Laboratories). 

high frequency operation. The latter shown in Fig. 14-3b has been employed to 
provide low-loss coupling between optical fibers and optoelectronic devices in 
optical communications systems. Here the laser (or detector) rests beneath the 
apex of the etched pyramid in which the optical fiber is precisely positioned. 
This microassembly package provides for low-loss electrical interconnection 
between optoelectronic and other electronic devices on a common Si substrate. 

Precise knowledge of etch rate anisotropies and selectivities for Si and Si0 2 
is required for designing successful micromachining etching treatments. In a 
recent study (Ref. 8), utilizing KOH/H 2 0 etchants, the following etch rates 
( R ) were measured as a function of temperature: 

0.61 eV 

/? Si (l00) = 6.19 X 10 8 exp--- (ftm/min), (14-1) 

fC1 

0.77 eV 

fl Si (lll) = 3.19 X 10 9 exp--- (jtm/min), (14-2) 

/C i 

1.07 eV 

R sio 2 = 5.49 X 10 12 exp- —— (ftm/min), (14-3) 
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Figure 14-4. Etching geometry of Si-Si0 2 structure. 


The ratio /? Si (100)//? Si (l 11) defines the etch rate anisotropy M 100/m ) and 
the ratio /? Si (100)//? SiO2 represents the selectivity. 

As an example in the use of these etch rates consider a (100) Si wafer 
containing a 2 /xm thermally grown Si0 2 film so patterned to open windows to 
the Si surface (Fig. 14-4). After etching at 100 °C for 15 min, how much does 
the Si0 2 etch mask overhang the slanted Si wall? During etching, both the 
(100) and (111) planes recede along their direction normals. The angle between 
the [100] and [111] directions is 54.7°. Therefore geometric considerations 
indicate that the net overhang length x at any time t is given by 

x = (/? Si (lll)/sin54.7 - R Si o 2 )t, (14-4) 

where the isotropic etching of Si0 2 is accounted for. Direct substitution of 
/? Si (lll) = 0.126 /xm/min, /? siO2 0.0191 fx m/min, / = 15 min, and sin54.7 
= 0.816, yields x = 2.03 fxm. Depending on the width of the Si0 2 mask 
window, V-shaped pits or flat-bottomed troughs can be etched into Si. 


14.2. Diamond Films 


14.2.1. Introduction 

Derived from the Greek a 5afxa<> ( adamas ), which means unconquerable, 
diamond is indeed an invincible material. In addition to being the most costly 
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on a unit weight basis, and capable of unmatched beauty when polished, 
diamond has a number of other remarkable properties. It is the hardest 
substance known ( H v > 8000 kg/mm 2 ), and has a higher modulus of elasticity 
(E = 1050 GPa) than any other material. When free of impurities, it has one 
of the highest resistivities (p > 10 16 12-cm). It also combines a very high 
thermal conductivity (k = 1100 W/m-K) that exceeds that of Cu and Ag, with 
a low thermal expansion coefficient (a = 1.2 x 10 -6 K _1 ) to yield high 
resistance to thermal shock. Lastly, diamond is very resistant to chemical 
attack. These facts, the first three, in particular, have spurred one of the most 
exciting and competitive quests in the history of materials science—the synthe¬ 
sis of diamond. Success was achieved in 1954 with the General Electric Corp. 
process for producing bulk diamond utilizing extremely high pressures and 
temperatures. Interestingly, however, attempts to produce diamond from low- 
pressure vapors date back at least to 1911 (Ref. 9). P. D. Bridgeman, in a 
1955 Scientific American article, speculated that diamond powders and films 
should be attainable by vapor deposition at low pressures (Ref. 10). By the 
mid-1970s the Russian investigators Deijaguin and Fedeseev had apparently 
grown epitaxial diamond films and whiskers during the pyrolysis of various 
hydrocarbon-hydrogen gas mixtures (Ref. 11). After a decade of relative 
quiet, an explosive worldwide interest in the synthesis of diamond films and in 
their properties erupted, which persists unabated to the present day. 

Isolated C atoms have distinct 2s and 2p atomic orbitals. When these atoms 
condense to form diamond, electronic admixtures occur, resulting in four equal 
hybridized sp 3 molecular orbitals. Each C atom is covalently attached to four 
other atoms in tetragonal bonds 1.54 A long creating the well-known diamond 
cubic structure (Fig. l-2c). Graphite, on the other hand, has a layered 
structure. The C atoms are arranged hexagonally with strong trigonal bonds 
(sp 2 ) and have an interatomic spacing of 1.42 A in the basal plane. A fourth 
electron in the outer shell forms weak van der Waals bonds between planes that 
account for such properties as good electrical conductivity, lubricity, lower 
density, a grayish-black color and softness. 

In addition, C exists in a variety of metastable and amorphous forms that 
have been characterized as degenerate or imperfect graphitic structures. In 
these, the layer planes are disoriented with respect to the common axis and 
overlap each other irregularly. Beyond the short-range graphitic structure, the 
matrix consists of amorphous C. A complex picture now emerges of the 
manifestations of C ranging from amorphous to crystalline forms in a contin¬ 
uum of structural admixtures. Similarly, the proportions of sp 2 -sp 3 (and even 
sp 1 ) bonding is variable causing the different forms to have dramatically 
different properties. Not surprisingly, this broad spectrum of metastable car- 
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bons have been realized in thin-film deposits. What now complicates matters 
further is that the many techniques to produce carbon films use precursor 
hydrocarbon gases. Hydrogen is, therefore, inevitably incorporated, and this 
adds to the complexity of the deposit structure, morphology, and properties. 

Given the structural and chemical diversity of carbon films, an understand¬ 
able confusion has arisen with regard to the description of these materials. 
Labels such as hard carbon, amorphous carbon (a-C), hydrogenated amor¬ 
phous carbon (a-C:H), ion-beam-processed carbon (i-C), diamondlike carbon 
(DLC), as well as diamond have all been used in the recent literature. The 
ensuing discussion will treat the deposition processes and properties of these 
films with the hope of clarifying some of their distinguishing features. 

14.2.2. Film Deposition Processes 

At the outset it is important to realize that synthesis of bulk diamond occurs in 
the diamond stable region of the P-T phase diagram (Fig. 1-11). Thin 
“diamond” films, on the other hand, clearly involve metastable synthesis in 
the low-pressure graphite region of the phase diagram. The possibility of 
synthesizing diamond in this region is based on the small free-energy differ¬ 
ence (500 cal/mole) between diamond and graphite under ambient conditions 
(Ref. 12). Therefore, a finite probability exists that both phases can nucleate 
and grow simultaneously, especially under conditions where kinetic factors 
dominate, such as high energy or supersaturation. In particular, the key is to 
prevent graphite from forming or to remove it preferentially, leaving diamond 
behind. The way this is done practically is to generate a supersaturation or 
superequilibrium of atomic H. The latter can be produced utilizing 0.2-2% 
CH 4 -H 2 mixtures in microwave plasmas or in CVD reactors containing hot 
filaments. Under these conditions, atomic H is generated and, in turn, fosters 
diamond growth either by inhibiting graphite formation, dissolving it if it does 
form, stabilizing sp 3 bonding, or by promoting some combination of these 
factors. In general, hydrocarbon, e.g., CH 4 , C 2 H 2 , decomposition at sub¬ 
strate temperatures of 800-900 °C in the presence of atomic H is conducive to 
diamond growth on nondiamond substrates. Paradoxically the copious amounts 
of atomic H result in very little hydrogen incorporation in the deposit. The 
modem era of CVD synthesis is coincident with the beautiful SEM images of 
diamond crystallites produced in the manner described. These have captured 
the imagination of the world and examples of the small faceted “jewels,” 
grown at high temperatures on nondiamond substrates, are shown in Fig. 14-5. 

The a-C:H materials are formed when hydrocarbons impact relatively 
low-temperature substrates with energies in the range of a few hundred eV. 
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Figure 14-5. Diamond crystals grown by CVD employing combined microwave and 
filament methods. (Courtesy of T. R. Anthony, GE Corporate Research and Develop¬ 
ment). 

Plasma CVD techniques employing rf and dc glow discharges in assorted 
hydrocarbon gas mixtures commonly produce a-C:H deposits. The energetic 
molecular ions disintegrate upon hitting the surface and this explains why the 
resulting film properties are insensitive to the particular hydrocarbon em¬ 
ployed. It is thought that the incident ions undergo rapid neutralization and the 
carbon atoms are inserted into C—H bonds to form acetylenic and olefinic 
polymerlike structures, e.g., C + R—CH 3 -► R—CH=CH 2 , where R is the 
remainder of the hydrocarbon chain. The resultant films, therefore, contain 
variable amounts of hydrogen with H/C ratios ranging anywhere from — 0.2 
to — 0.8 or more. They may be thought of as glassy hydrocarbon ceramics 
and can be even harder than SiC. 

Amorphous carbon (a-C) diamondlike films are prepared at low tempera¬ 
tures in the absence of hydrocarbons by ion-beam or sputter deposition 
techniques. Both essentially involve deposition of carbon under the bombard¬ 
ment of energetic ions. Simple thermal evaporation of carbon will, of course, 
yield highly conductive, soft films that are quite remote in their properties 
from the hard, very resistive, high-energy band-gap diamondlike materials. 
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The ion impact energy, therefore, appears to be critical in establishing the 
structure of the deposit. More diamondlike properties are produced at low 
energy; microciystalline diamond ceases to form when the ion energy exceeds 
~ 100 eV, in which case the amorphous structure prevails. 

An important consideration in the eventual commercialization of deposition 
processes is the growth rate. For both diamond and diamondlike films rates 
generally range from less than 1 up to a few jim per hour. These values should 
be compared with the 10 3 fxm/h rate for the commercial process that produces 
diamond abrasive grain. 


14.2.3. Properties and Applications 

The properties of CVD synthesized diamond, a-C and a-C:H film materials are 
compared with those of bulk diamond and graphite in Table 14-1. Basic 


Table 14-1. Properties of Carbon Materials 


Properly 

CVD 

Diamond 

Thin Films 

a-C 

a-C:H 

Bulk 

Diamond 

Graphite 

Crystal structure 

Cubic 

Amorphous, 

Amorphous, 

Cubic 

Hexagonal 


= 3.561 A 

mixed sp 2 - 

mixed sp 2 - 

a 0 = 3.567 A 

a « 2.47 



sp 3 bonds 

sp 3 bonds 



Form 

Faceted 

Smooth to 

Smooth 

Faceted 



crystals 

rough 


crytals 


Hardness, H v 

3,000-12,000 

1,200-3,000 

900-3,000 

7,000-10,000 


Density 

2.8-3.5 

1.6-2.2 

1.2-2.6 

3.51 

2.26 

Refractive index 

— 

1.5-3.1 

1.6-3.1 

2.42 

2.15 






1.81 

Electrical 

> 10 13 

> 10 10 

ioMo 14 

> 10 16 

0.4 

resistivity (fi-cm) 





0.20 

Thermal 

1100 

— 


2000 

3500 

conductivity 





150 

(W/m-K) 






Chemical stability 

Inert 

Inert 

Inert 

Inert 

Inert 


(inorganic 

(inorganic 

(inorganic 

(inorganic 

(inorganic 


acids) 

acids) 

acids and 

acids) 

acids) 




solvents) 



Hydrogen content 

— 

— 

0.25-1 

— 

- 

(H/C) 






Growth rate 

- 1 

2 

5 

1000 

— 

(M m /h) 




(synthetic) 



From Refs. 12 and 13. 
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differences in structure and properties of diamond and diamondlike films 
ultimately stem from the sp 3 -sp 2 bond concentration ratios. Considerable 
bond admixtures occur in both the a-C:H and a-C films and much experimental 
effort has been expended in determining the bonding proportions. Techniques 
such as Raman spectroscopy, nuclear magnetic resonance, and X-ray photo¬ 
electron spectroscopy (XPS) are used to characterize films and bolster claims 
for the presence of the elusive diamond crystals. Although there is a great deal 
of scatter in many of the film properties due to differing deposition conditions, 
it is clear that the films are extremely hard, chemically inert, and highly 
insulating. 

The attractive attributes of carbon film materials have already been commer¬ 
cially exploited in a number of cases as indicated in Table 14-2. Additional 
applications have been suggested and are the subject of intense current research 
and development activities. For many applications crystalline diamond is not 
essential; diamondlike films will do. With improved film morphology and 


Table 14-2. Actual and Suggested Applications of Diamond and Diamondlike Films 



Application 

Properties Required 

Comments 

1 . 

Resonator diaphragms 
of tweeter loud 
speakers 

High modulus of elasticity 

Frequency response up to 
60,000 Hz possible; 
commercially available 

2. 

Ultrahard tool 
coatings 

High hardness 

Commercially available 

3. 

Sunglass lenses 

High hardness, scratch 
resistance, optical 
transparency 

Commercially available 

4. 

Computer hard disk 
coatings 

High hardness, low wear 

Coatings minimize 
head-disk contact wear 

5. 

Watch cases 

High hardness, scratch 
resistance 


6, 

Prosthetic devices 

High hardness, low wear 


7. 

Optical coatings 

High hardness, high index 
of refraction 


8. 

Infrared laser window 

Transparency to IR 


9. 

Electronic devices— 
traveling wave 
amplifiers 


Heteroepitaxial films 
required 

10 . 

Semiconductor device 

heat sinks 

High thermal conductivity 

Commercially available 

11. 

High-temperature 
semiconductor devices 

Large energy band gap 


12. 

Abrasive grain 

High hardness 
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properties that come with better control of deposition processes, the expanded 
use of these films can certainly be anticipated. 


14.3- High T c Superconductor Thin Films 


14.3.1. Introduction 

The unexpected discovery of high T c superconductivity has fundamentally 
challenged our previous understanding of the subject. Interestingly, critical 
values of temperature, magnetic field, and current density together with the 
Meissner effect still define and limit high-7^ superconductivity. However, 
almost everything about previous theories of superconductivity has been called 
into question, including applicability of such concepts as band gaps, carrier 
pairing, coherence length, etc., to high T c oxides. Therefore, the discussion 
will focus on the composition and structure of these materials, film deposition 
techniques, properties, and thin-film applications. 


14.3.2. Composition and Structure 

The three most actively studied high-7^, superconductors (as of this writing) 
are listed in Table 10-3. YBa 2 Cu 3 0 7 was discovered first, is the easiest to 
prepare in bulk and thin-film form, and has been most extensively investigated. 
A unit cell of this material is shown in Fig. 14-6. The structure is a variation of 
the class of oxygen-defect perovskites involving a tripling of unit cells, 
Perovskites have the property of reversibly absorbing or losing oxygen and are 
therefore nonstoichiometric with respect to this element. Much effort has been 
expended in correlating crystal structure and oxygen content with T c . As the 
oxygen content increases from 6.3 to close to 7 atoms per cell T c is observed 
to increase from 30 to — 90 K. Concurrently both the a and c lattice 
constants decrease, whereas that for b increases—each by approximately 1 % 
(Ref. 15). Current transport is believed to occur along the Cu-0 ribbons (b 
axis). The pyramidal Cu0 2 sheets perpendicular to the c axis reflect the 
layered structure of this as well as other high- T c oxide materials. Tl./ough its 
effect on atomic spacing oxygen necessarily also modifies the valence of Cu as 
well as the Cu—O bond length; increasing O decreases the former and 
increases the latter. Since Cu appears to be an essential ingredient in high T c 
oxides, it has been argued that its valence state and nature of bonding to O 
critically influence superconducting properties. In fact, loss of oxygen with 
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Figure 14-6. Structural model of the unit cell for YBa 2 Cu 3 0 7 . Squares are vacant 
sites. (From Ref. 14 with permission from Kluwer Academic Publishers). 

attendant lowering of T c is a major degradation mechanism in thin films. 
An overall oxygen stoichiometry of very nearly 7 is required for optimal 
properties. 

14.3.3. Film Deposition Techniques 

Among the methods employed to prepare high -T c films are multisource 
evaporation (electron beam and resistance heated), single and multigun sputter¬ 
ing, MBE, pulsed laser (flash) evaporation, MOCVD as well as spin pyrolysis 
and plasma spraying of powders (Ref. 16). Since the vapor pressures of Y, Ba, 
and Cu vary widely they are not amenable to single source evaporation; rather 
three separately controlled elemental sources are used. Films prepared by 
evaporation or sputtering from metallic melts or targets require a subsequent 
high-temperature (e.g., 850-950 °C) oxidation treatment in order to assure 
that requisite levels of O are incorporated. To eliminate this step, in situ 
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growth methods have been developed, utilizing reactive evaporation and 
sputtering, oxygen rf plasmas, microwave generated atomic oxygen and ozone 
production schemes. Regardless of deposition technique substrate heating 
(from 300 to 800 °C) appears to be universal. 

In achieving high-quality films the choice of substrate is critical. Substrates 
must be resistant to high-temperature exposure, degradation in oxidizing 
atmospheres and interdiffusion reactions with deposited films. Furthermore, 
high-7^ epitaxial films require crystalline substrates with small lattice mis¬ 
match and similar thermal expansion coefficients. Substrates employed have 
included A1 2 0 3 (sapphire), MgO, Zr0 2 stabilized with Y, Si, LaGa0 3 , 
NdGa0 3 , and SrTi0 3 . The influence of different substrates on the supercon¬ 
ducting characteristics of e-beam evaporated films is shown in Fig. 14-7; a 
relatively small effect on T c is evident. 



Figure 14-7. Resistance-temperature characteristics of evaporated YBa 2 Cu 3 0 7 films 
on three different substrates. (Courtesy of R. B. Laibowitz, IBM T. J. Watson Research 
Laboratory). 
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14.3.4. Properties and Applications 

Typical resistance-temperature characteristics for YBa 2 Cu 3 0 7 films prepared 
by evaporation, sputtering and MOCVD are shown in Fig. 14-8 where values 
of T c around 90 K are evident. Superconducting transitions as narrow as 0.5 K 
have been achieved together with critical currents in excess of 10 6 A/cm 2 at 77 
K, and greater than 10 7 A/cm 2 at 4 K. Higher current densities than the critical 
value cause the material to become normal. 

One of the troublesome problems in high-7^ superconductors is the very 
short coherence length. Tunneling processes sample states very close to the 
surface as a result. In films of these materials surfaces tend to be rough, 
contain nonsuperconducting cuprates and lose oxygen. These effects adversely 
affect the quality of interfaces in tunnel junctions. 

Low-loss, low-dispersion microwave waveguide coatings appear to be the 
thin-film application closest to being realized. Small electrical resistance at 
high frequency is an essential requirement and high-7^ superconductors have a 
considerably smaller surface resistance than Cu. Problems related to high-tem¬ 
perature deposition and processing of films, lithographic patterning of small 
features, and compatibility with other materials and device structures have 
served to hinder rapid development of microelectronic applications. 



Figure 14-8. Resistance-temperature characteristics of evaporated and sputtered 
YBaCuO films on LaGa0 3 substrates. (Courtesy of R. B. Laibowitz, IBM T. J. Watson 
Research Laboratory). MOCVD results courtesy of B. Gallois. 
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14.4. Films for Magnetic Recording 


14.4.1. Scope (Ref. 17) 

Ferromagnetic thin films already play and will continue to have a major role in 
magnetic recording and storage technology. The needs of both professional and 
consumer audio, video, and computer tapes and disks are currently met by an 
assortment of magnetic particle and thin-film materials. However, the insa¬ 
tiable appetite for data storage continues to push magnetic disk technology to 
ever higher recording densities at lower cost. Currently the storage media 
industry is dominated by the “brown disk’* that contains fine Fe 2 0 3 magnetic 
particles embedded in an organic binder. A basic reason for the use of 
thin-film recording media is greater available signal amplitude relative to 
particulate coatings. The latter are characterized by a linear recording density 
of 10 4 bits per inch of circular track with a track density of 10 3 tracks per 
inch. Thin film media consisting of 1000 A thick electroplated Co-P and 
Co-Ni-P films, already used for computer data storage on rigid risks, offer 
the capability of significantly extending these recording densities. The reason 
is due to the combined effect of 100% packing of magnetic material in 
films—compared with 20-40% in particulate media—and the generally higher 
magnetization possible with Co base alloys. Therefore, the same amount of 
magnetic flux can be contained within a thinner coating enabling the storage 
layer to be closer to the recording head for more efficient recording and 
reading. Importantly, higher storage densities mean greater miniaturization. 
Thus it is that thin-film media usage has largely been driven by the desire to 
reduce the size of personal computers and portable video recording and 
playback systems. 

The basic conversion of the temporal electrical input signals (e.g., linear ac, 
digital, FM, etc.) into spatial magnetic patterns occurs when the storage 
medium translates relative to a recording head as schematically shown in Fig. 
14-9. The medium is either a magnetic tape or flat disk while the head is a 
gapped soft ferrite toroid with windings around the core portion located away 
from the gap. If the input fringe field signal has a frequency / and the medium 
is moving at a relative velocity u, the magnetization pattern will be recorded at 
a fundamental wavelength of X = u//, which is twice the bit length (Ref. 19). 
Video recording at wavelengths of 0.75 fx m represents the highest density 
recording in use today. The spatially varying magnetization pattern in the 
medium produces directly proportional external magnetic fields. When the 
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Figure 14-9. (Above) Longitudinal magnetic recording process; (below) perpendicu¬ 
lar magnetic recording process. (From Ref. 18 © 1985 Annual Reviews Inc.). 


medium is read by passing the recording past a reproduce or read head, these 
fields generate the magnetic flux ($) which circulates through the high-per- 
meability core. By Faraday’s law the flux that threads the windings generates 
the temporal reproduce voltage V 0 : 


d$ d$ 

V 0 = -= — JVu —— 

0 dt dx 


where x = vt and N is the number of reproduce turns. From the foregoing, it 
is apparent that magnetic recording systems require opposite but complemen¬ 
tary magnetic properties, i.e., soft magnetic materials for the recording and 
playback head components and hard magnetic materials for the storage media. 
The magnetic properties of some of these materials are listed in Table 10-4. In 
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the next two sections we further explore their use in magnetic recording 
applications. 

14.4.2. Thin-Film Head Materials (Ref. 17) 

The phenomena of magnetic induction and magnetoresistance are capitalized 
on in the operation of heads. Inductive heads can be used both to record and 
read. High-permeability, soft magnetic materials such as sintered ferrites and 
Sendust (85 Fe-9 Si-5.4 A1 by weight) have traditionally been used in their 
manufacture. To improve performance. Permalloy Films ranging in thickness 
from 2 to 10 /im have been deposited on the yoke structures. Permalloy, a 
favored material for many soft magnetic film applications, has the following 
properties: 4ttM s = 10 kG, H c — 0.5 Oe, permeability = 1500-2000 and 
resistivity - 18 fiQ- cm. Many deposition processes have been employed, 
e.g., electroplating, sputtering (dc, rf, ion beam) and evaporation. Other film 
materials which have been deposited for this purpose include Mu metal, 
Sendust, and Co-Zr-based alloys. Amorphous magnetic glasses such as 
Fe 80 B 20 , Fe 80 B l5 C 5 , and Fe 73 Si 18 C 9 have also been used. They have values 
of 4ttM s in excess of 15 kG with H c less than 1 Oe. 

Magnetoresistance head sensors are read only devices. Again, Permalloy 
films have been used to detect magnetic fields through changes in electrical 
resistivity. In general the fractional change in magnetoresistance (A p/p) 
varies as H 2 . It further depends on cos 2 0, where 6 is the angle between the 

o 

film magnetization and current density vectors. Typically, 1000 A thick 
Permalloy films experience changes in A p/p of a few percent. 

14.4.3. Thin-Film Recording Media 

Two types of recording media can be distinguished, i.e., longitudinal and 
perpendicular (or vertical), depending on whether the magnetization vector 
lies in the film plane or is normal to it. For longitudinal media it is desirable 
that films display square hysteresis loops with M s at least several hundred 
Gauss and H c greater than 500 Oe. 

Magnetic properties, and H c in particular, are influenced by film composi¬ 
tion, thickness, grain size, perfection, impurity content, surface roughness and 
nature of the substrate. These factors in turn depend on the method of 
deposition and on such variables as substrate temperature, deposition angle, 
and magnitude and orientation of applied magnetic fields. Combinations of 
deposition variables must be controlled to yield desired film anisotropies. 

Oblique evaporation and application of external magnetic fields have proven 
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successful in yielding in plane oriented films with desirable magnetic proper¬ 
ties. For example Fe-Co-Cr films are evaporated onto rotating rigid disks by 
evaporation at a 60° angle of incidence. A strong shape anisotropy develops 
with easy axis in the film plane. Self-shadowing of grains is apparently 
responsible. 

One of the limitations of longitudinal media is that magnetization reversals 
along the recording track tend to broaden the transition between neighboring 
magnetized zones. This is due to the demagnetizing effects caused by the 
mutual overlap of repulsive magnetic fields at the transition, an effect that 
essentially limits the achievable linear density of storage. In general the 
maximum packing density is proportional to M r d/H c , where M r is the 
remanent magnetization and d is the film thickness (Ref. 17). Thinner films 
are desired, but this reduces M r and the recording signal, so that trade-offs 
must be struck. Large coercive fields help resist demagnetizing fields and their 
effects. 

Now consider the possibility of perpendicular rather than in-plane 
anisotropy. The magnetization vector is now normal to the film plane and 
points alternatively toward or away from the surface along the track. There are 
no demagnetizing fields at the points of magnetic reversal, thus sharpening the 
transition and increasing the recording density. The discovery that CoCr alloy 
films (15-20 at % Cr) exhibit an easy axis of magnetization normal to the film 
has made the concept of high-density perpendicular recording a reality. In 
these materials the tendency toward in-plane magnetization is countered by 
additional perpendicular crystalline anisotropy. This results in hysteresis loops 
displaying the behavior H c (±) > H c (||) and A/ r (X) > Af r (||), where X and 
|| are the perpendicular and parallel components. Virtually all PVD processes 
have been utilized to deposit CoCr, CoCrX (X = Rh, Pd, Ta), and GdTbFe 
films for potential recording media. Additional essential requirements for these 
materials are corrosion and wear resistance. 

14.4.4. Substrates, Undercoats, and Overcoats 

The implementation of a viable thin-film recording technology necessitates 
consideration of a host of additional materials issues concerned with substrates, 
undercoats, and overcoats. These latter two layers sandwich the magnetic film 
in between. Substrates may be rigid or flexible depending on application. Rigid 
substrates of extremely fine surface finish are used for high-density, rapid 
direct access disk files. They are presently fabricated from an Al-Mg alloy. 
Substrates must be hard and this necessitates an underlayer, usually an 
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Figure 14-10. Schematic arrangement for continuous oblique evaporation of mag¬ 
netic Films (also undercoats and overcoats) onto a continuous web for video tape 
applications. The Co-Ni source is evaporated by an electron beam. (Reprinted with 
permission from IEEE, © 1986 IEEE, from Ref. 20). 

electroless plated Ni deposit that is amorphous and nonmagnetic. Elimination 
of all surface asperities is critical prior to the deposition of glue layers to 
promote adhesion. Next the magnetic films, only a few thousand angstroms 
thick, are deposited. Finally wear-resistant overcoats are required because the 
read-write heads fly over the disc surface at very close proximity and actually 
make contact during stopping and starting. These mechanical interactions cause 
disk and head friction and wear, and even catstrophic head crash. Therefore, 
hard carbon, diamondlike and other hard films have been deposited to mini¬ 
mize these effects. Additionally, solid lubricants are used in conjunction with 
these hard overcoats. 

Tapes and flexible disks are composed of a polymer-polyethylene teraphtha- 
late (PET). In the case of video tape the commercial system for oblique 
deposition of CoNi onto a continuous web of PET is schematically depicted in 
Fig. 14-10. As the tape moves around the drum it passes by an aperture mask 
which controls the range of incident vapor angles intercepted. Higher coercivi- 
ties and squareness ratios result when the tape is moved in the direction of 
decreasing rather than increasing angle. Critical to the development of desir¬ 
able magnetic properties are the conditions for nucleation of a canted columnar 
grain structure. 
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14.5. Optical Recording 


14.5.1. Introduction 

Over the past 15 years various systems for optical recording have been 
developed. The best known are the video disk and the digital audio disk or 
compact disk (CD). Both are intended to play back information stored on the 
disk and therefore employ read only media. The information signal is 
recorded by the manufacturer in the form of micron sized pits on the disk 
surface. A laser beam is employed in the playback process, which is based on 
modulation of the light reflected by the pits (Refs. 21, 22). Electronic signal 
processing then yields the desired video or audio output. 

There are also systems where the user can record information on a disk. 
They rely on a focused laser beam of relatively high power, whose intensity is 
modulated corresponding to the information being recorded. The disk contains 
a film sensitive to the laser light. Upon irradiation, local property changes or 
effects are produced that provide sufficient optical contrast when read out by a 
much weaker laser beam. Laser-film interactions that have been exploited 
include 

1. Formation of holes and pits by melting and flow of polymer materials 

2. Local changes of magnetization in magnetic films subjected to an external 
magnetic field (magneto-optical recording) 

3. Amorphous to crystalline (and vice versa) phase transformation (phase- 
change recording) 

Only the latter two effects will be discussed at any length here. In both, 
laser-film interactions exhibit the important feature of reversibility or erasabil- 
ity. But it is the extremely high storage density capability, made possible by 
the finely focussed laser beam, that is the primary attraction of magneto-optic 
and phase change optical recording. Densities of - 10 8 bits/cm 2 , some 10 
times that of high-performance magnetic disk drives, and 50-100 times the 
density of low-end disk drives has stimulated much interest in erasable optical 
recording for computer data storage applications. The fact that catastrophic 
head-disk crashes are eliminated is an added advantage. Unlike magnetic 
recording where heads contact the disk, lasers are located at least — 1 mm 
away. 

14.5.2. The Magneto-Optical Recording Process (Refs. 23, 24) 

Magneto-optical recording relies on thermomagnetic effects. Information is 
stored in a magnetic film magnetized perpendicular to the surface, e.g., in the 
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Figure 14-11. Schematic diagram illustrating the writing and reading processes in a 
pregrooved multilayer magneto-optic disk. (From Ref. 25 with permission from Else¬ 
vier Sequoia S.A.). 

upward direction. During writing, the modulated linearly polarized laser beam, 
with a diffraction limited diameter of - 1 \x m, impinges on the recording 
material as shown in Fig. 14-11. In Curie-point writing , the film is locally 
heated close to or above the Curie temperature (7" c ), where the net magnetiza¬ 
tion rapidly declines or effectively vanishes, respectively. Under the influence 
of an opposing external magnetic field (//), the direction of magnetization 
reverses relative to that of the nonirradiated neighboring region. This new 
magnetization is frozen in as the material cools to room temperature. Alterna¬ 
tively, in other materials, the magnetization direction can even be switched at 
temperatures far below T c . In this case we speak of compensation-point 
writing , an effect made possible because the coercive field ( H c ) of these 
materials decreases rapidly with temperature. Therefore, as soon as H > H c 
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magnetization reversal occurs. The phenomenon of compensation is exhibited 
by /w/magnetic materials which consist of sublattices or subnetworks of 
antiparallel aligned magnetic moments, each having a different temperature 
dependence of magnetization. These materials, however, have a compensation 
temperature 7^. omp (< T c ) at which the sublattice magnetizations balance. The 
net magnetization then vanishes but H c is very large. Above 7" comp , H c falls. 

After writing there are regions of up and down magnetization in the 
recording track corresponding to, for example, 1 and 0. This information can 
now be read back (Fig. 14-11) using the polar Kerr magneto-optic effect. 
Rotation of the plane of polarization of a linearly polarized light beam after 
reflection from a vertically magnetized magnetic material is the basis of the 
effect. The sense of rotation depends on the magnetization direction in the 
recording film layer. Compared with the writing process, the laser beam 
intensity for reading is much lower. 

Finally the recorded information can be erased by laser irradiation of the 
written domains, but now with H in the direction of the original film 
magnetization. 


14.5.3. Magneto-Optical Film Materials 

Before addressing their actual properties and compositions the issue of why 
films are used deserves brief mention. The primary reasons are the great speed 
of heating and cooling that is possible in films of low thermal mass, and the 
high-storage-density continuous films (rather than particles) afford. Coupled 
with well-developed physical vapor deposition processes that enable economy 
and efficiency of materials utilization (low cost per unit area), thin films are 
universally employed. Desired materials properties include (Refs. 25, 26) 

1. Large value of the intrinsic uniaxial perpendicular anisotropy. 

2. Low T c or 7^ temperatures. During both Curie and compensation point 
recording a temperature of 150 °C is a desirable upper limit. 

3. High H c values, e.g., 1-2 kOe. High H c values ensure domain stability at 
room temperature and absence of growth or shrinkage of domains during 
readout or erasure elsewhere on the disk layer. 

4. A large magneto-optic Kerr effect. 

5. A large saturation magnetization ( M s ). This facilitates writing in weaker 
external magnetic fields and formation of smaller stable domains. 

Alloys of rare earth (RE) and transition metals (TM) are most commonly 
used for magneto-optical recording applications. Thin films of RE-TM alloys 
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are usually deposited in the amorphous state by sputtering and evaporation, 
processes amenable to coating of very large plastic disk substrates. Alloys 
which have been investigated include TbFe ( T c = 80-150 °C), GdFe ( T c = 
180 °C), GdTbFe (T c = 165 °C), GdTbCo (r comp ~ 380 °C), TbFeCo 
(T c = 180 °C) as well as a host of other binary (notably MnBi), ternary and 
quaternary alloys (Ref. 26). Magnetic properties of some of these materials are 
given in Table 10-4. 

14.5.4. Phase-Change Optical Recording (Refs. 22, 27) 

Like magneto-optical recording, phase change optical recording relies on lasers 
to (1) write information into the recording layer, (2) read, and (3) erase 
information—all through heating. Poly crystalline thin films are used as the 
recording layer. During writing localized amorphous regions are created 
within the crystalline surroundings as a result of laser-pulse-induced melting, 
followed by ultrahigh quench rates (see Section 13.2.3). Optical property 
differences between the two phases causes modulation in the reflected light 
during reading, yielding a signal that varies with time. Because the crystalline 
phase is usually both more reflective and more opaque than the amorphous 
material, the disk could be read in either a reflection or transmission playback 
mode. Regions which were amorphized can be made crystalline again by 
heating the recording film to a transformation temperature below the melting 
point. The recrystallization time must be shorter than the laser pass time during 
erasure. Required temperature—time characteristics for the phase-change 
medium are shown in Fig. 14-12. 

Selecting suitable materials for reversible phase-change recording poses a 
considerable challenge because of the unusual combination of properties that 
must be optimized, namely 

1. Fusibility at moderate laser powers (—10 mW) 

2. Sufficient optical contrast on readout 

3. Long time stability of the amorphous phase at room temperature 

4. Rapid recrystallization times (typically 1 //sec at several hundred degrees 
Celsius) 

5. No degradation of performance after numerous recording playback-erase 
cycles. 

Chalcogenide (S, Se, Te containing) film materials have been most exten¬ 
sively explored for reversible phase-change recording. A pure Te recording 
layer will meet many of the required demands but the amorphous state is not 
stable at room temperature. For this reason alloy films have been investigated 
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Figure 14-12. Write-read-erase temperature-time characteristics for amorphous to 
crystalline phase change materials. T m and T x refer to melting and recrystallization. 
(Courtesy of M. Libera). 


and proven more suitable. Compositions studied have included Te-As-Ge, 
Te-Se-S, Te-Se-Sb, In-Sb-Te, and Te-Ge-Sn. Development of this type 
of optical recording has lagged behind magneto-optical recording due to the 
materials limitations already hinted at. 


14.6. Integrated Optics 


14.6.1. Introduction (Refs. 28- 30) 

The term integrated optics was coined by S. E. Miller at Bell Laboratories in 
1969 (Ref. 31) and refers to the integration of components which generate a 
light carrier wave, and then modulate (putting the message on it), switch and 
detect the light. Rather than perform these functions separately with indepen¬ 
dent devices spread out, for example, on an optical bench, the intent is to 
fabricate all of these components in miniature on a small substrate, thus 
eliminating recourse to conversion into electronic signals. The loose analogy to 
electronic integrated circuits is apparent. More efficient and reliable optical 
communications has been the goal driving much of the research and develop¬ 
ment of integrated optical devices and components. One benefit accruing from 
the reduced size and weight is lower electrical power requirements, an 
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important consideration in repeaters for long-distance fiber optic systems. In 
addition, the potential exists for combinations of both optical and solid-state 
electronic devices or integrated circuits sharing a common substrate; advan¬ 
tages include enhanced electro-optical signal processing speed and immunity to 
temperature variation and mechanical vibration. The use of well-established 
microelectronics lithography and processing methods holds the promise of 
low-cost, reliable components for all types of fiber optic communications and 
sensor systems. 

Integrated optics is based on guiding light at optical and near-infrared 
frequencies that are some 10 4 higher than those operational in electronic 
devices. Therefore, the amount of information carried by light signals is 
correspondingly higher. The light is transmitted through waveguides, the basic 
structure of many important optical components. In this sense integrated optics 
is more akin to microwave technology (with tubular waveguides) than conven¬ 
tional geometric optics. The remainder of the section will start with a cursory 
introduction to light propagation in planar waveguides and related components. 
This will be followed by a treatment of relevant thin film materials and 
processing issues involved. 


14.6.2. Light Propagation in Planar Waveguides 


The simplest planar waveguide has the sandwich structure shown in Fig. 
14-13. In the center is a thin film whose thickness is of the order of the 
wavelength of the light propagating through it, i.e., — 1 ^m. The index of 
refraction of this film () exceeds that of either the substrate (n s ) or cover 
layer ( n c ), which serve as the cladding. 

Consider the case where n f > n s > n c . Only Snell's law of refraction, 


sin 4> c ri f sin (f> f n s 

sin <t>f n c sin </> 5 nj ’ 


(14-6) 


coupled with the phenomenon of total internal reflection, is needed to explain, 
by ray-optic methods, the guiding of light in this three layer asymmetric 
waveguide. Referring to Fig. 14-13 we note the following behavior as the 
incident angle of light, <f > s , increases from near zero. At small angles the light 
passes through both the film-substrate and film-cover interfaces. Light exits 
the cover layer in this so called air radiation mode. As <t> s is increased such 
that <j> f is greater than the critical angle for internal reflection at the n f /n c 
interface, the light is partially confined but exits the substrate (substrate 
radiation mode). Finally <j> s increases to the point where <f>j also exceeds the 
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(a) (b) (c) 




Figure 14-13. Optical ray patterns from (a) air radiation mode, (b) substrate radia¬ 
tion mode, (c) guided mode. Wave representation of modes is shown in d. (From Refs. 
28, 29). 

critical angle for total internal reflection at the n f /rt s interface. Now the light 
wave is totally confined as it zigzags forward in a guided mode. In this case the 
critical angle is given by 


<t> f > arc sin ( n s / n f ). (14-7) 

There is another important way of describing the guided mode. In Fig. 
14-13c the motion of each of the two propagating rays has two 
components—one perpendicular to and one parallel to the film axis. Parallel 
components add and account for light propagation along the film waveguide. 
Perpendicular component motions oppose each other and set up a transverse 
standing wave pattern, which is viewed in Fig. 14-13d as sweeping down the 
guide. The modes are labeled by the index m equal to the number of times the 
light intensity goes to zero across the film thickness (tf). Just how many modes 
a given waveguide can support depends on n fJ n s , n c , d and A, the 
wavelength of the light (Ref. 28). The most desirable situation for many 
applications is single mode (m = 0) propagation. This is not difficult to 
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achieve because thin films accommodate fewer modes, and there are practical 
limitations in preparing thick-film waveguides. One aspect of the wave repre¬ 
sentation of modes worth noting is the apparent leakage of light to the sides. 
For the m = 0 mode the wave pattern inside the film is cosine-like; outside 
there is an exponential decay, usually at different rates in each of the cladding 
layers. In fact, readers familiar with the subject will note that mode wave 
patterns resemble the allowable quantum mechanical wave functions for an 
electron confined to a potential well with finite energy barriers (p. 665). 

14.6.3. Integrated Optics —Components and Devices 

Till now only one-dimensional, broad-film-area light confinement has been 
considered. Two-dimensional confinement in strips or channels however, is 
essential for the functioning of most of the components and devices employed 
in integrated optics. Common configurations of raised and embedded channel 
waveguides are shown in Fig. 14-14 together with a pair of components that 
incorporate them. At the outset it is important to distinguish between passive 
and active waveguide components. The channels of Fig. 14-14a, b are passive 
components because their optical properties are not modified by outside 
influences. Other examples include lenses, gratings, and prisms. Active com¬ 
ponents, on the other hand, enable controllable variations in the light ampli¬ 
tude, phase, frequency, polarization, and propagation direction by means of 
externally applied fields. Physical fields typically include electric, acoustic, or 



Figure 14-14. Thin-film waveguides and components: (a) channel and (b) embedded 
waveguides; (c) directional coupler; (d) electro-optical phase modulator 
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magnetic, corresponding to electro-optic, acousto-optic, or magneto-optic ef¬ 
fects, respectively. Under these physical influences active components serve as 
modulators, switches, deflectors, amplifiers, light emitters, and detectors. 

As an example, consider the directional coupler in Fig. 14-14c. Light 
traveling in the channel waveguide can be diverted into the neighboring 
U-shaped guide. The coupling between these closely spaced guides arises from 
the lateral leakage of light energy referred to earlier. “Evanescent” wave 
penetration of the second waveguide occurs along the length common to both 
guides; the light intensity builds and is eventually concentrated in the U guide. 
The coupling between the guides can be made variable either by piezoelectric 
means, or by altering the refractive index of the intervening medium by 
electro-optical methods. In Fig. 14-14d the electro-optic waveguide phase 
modulator is depicted. The waveguide is embedded in an electro-optical crystal 
substrate. When the modulation voltage signal is applied to parallel electrodes, 
the refractive index of either the guide or substrate is altered, influencing the 
phase of the guided wave. 

A different group of devices and components is involved in launching light 
into and out of guides. In the case of GaAs-based integrated optical systems, 
monolithic fabrication (from the same substrate) of coupled laser light sources, 
waveguides, and detectors is possible because of the epitaxial film deposition, 
doping, and lithographic techniques discussed previously in the book. In 
optical systems based on other materials (LiNb0 3 , glass) externally generated 
laser light must be coupled to the thin-film guides. Miniature prisms, thin-film 
diffraction gratings and tapered film couplers are some of the components used 
to achieve these ends. 

14.6.4. Film Materials and Processing 

A sizeable number of different materials have been employed over the years to 
fabricate thin-film waveguides and other integrated optics components. Some 
of these are listed in Table 14-3 together with values for their relevant 
properties. Single-crystal vs. non-single-crystal film structures represent a 
fundamental division in these materials. The former category includes GaAs 
together with other associated lattice-matched compound semiconductors (e.g., 
AlGaAs, InP, GalnAsP etc), LiNb0 3 and LiTa0 3 . These materials have been 
most extensively studied and will be given most of the attention. The remaining 
group of materials is amorphous or polycrystalline in form and consists of 
oxides, glasses, and polymers from which thin dielectric waveguides have been 
oroduced. Studies on such films were useful in the early days of integrated 
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Table 14-3. Materials Employed in Integrated Optics Components and Waveguides 


Material 

Form 

Fabrication 

Method 

Index of 

Refraction 

Loss at 

6328 A 
(dB/cm) 

GaAs 

S.C. 

Epitaxial 

3.6 (at 0.9 ^m) 


Al^Ga, _ a -As 

s.c. 

Epitaxial 

3.6 at x = 0 





2.95 at x = 1 


LiNb0 3 

S.C. 

Ti exchange 

An = 0.13 


(waveguide) 


Ion implant 

An = -0.1 

< 1 



Diffusion 

An = 0.1 


LiTa0 3 

s.c. 

Ag exchange 

An = 0.05 

< 1 

Soda lime 

A 

Ag + exchange 

1.51-1.61 

0.5 

Glass 


K + exchange 

1.51-1.52 

0.1 



Li + exchange 

1.51-1.52 

0.1 

Ta 2 O s 

A 

Sputtering 

2.2 

l 

Si 3 N 4 

A 

Sputtering 

2.0 

0.5 

ZnS 

PC 

Evaporation 

2.3 

5 

Polyurethane 

A 


1.54-1.58 

1 


S.C. = single crystal; A = amorphous; PC = polycrystalline. 
Adapted from Refs. 29 and 32. 


optics, but the role of these materials has diminished as the more versatile 
single-crystal films have become available. 

As noted earlier, well-known compound semiconductor fabrication methods 
enable monolithic integration of light sources (LEDs, lasers), waveguides, and 
detectors (photodiodes and transistors). The ease in varying the refractive 
index (Eq. 7-12) and energy gap (Eq. 7-11) makes Al^Ga^^As and other 
related semiconductors extremely versatile in this regard. Lattice-matched 
epitaxial heterostructures ensures devices of high structural integrity. In addi¬ 
tion to alloying as a method of changing the refractive index, alteration of the 
free-carrier concentration has the same effect. Thus removal of carriers from a 
region will create a higher refractive index relative to the surrounding medium; 
the former can then serve as a waveguide. One way to achieve carrier 
concentration reduction is by proton bombardment. Lastly, in order to produce 
operable semiconductor optical integrated circuits, it is necessary that 

is g (waveguide) > E g (light source) > ^(detector) 

—conditions that are left to the reader to verity. 

Single crystals of LiNb0 3 and LiTa0 3 owe their popularity to the fact that 
they exhibit large electro-optic effects, i.e., electric-field-induced changes in 
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refractive index. This accounts for their use as electro-optic modulators and 
switches. These remarkable materials are also piezoelectric enabling acousto¬ 
optic modulators and beam deflectors to be monolithically integrated within 
optical circuits. Most waveguides are produced either by indiffusion of metal 
(e.g., Ti) or outdiffusion of Li. For example, when LiNb0 3 or LiTa0 3 are 
heated to 1100 °C Li outdiffuses. As a consequence, the refractive index 
increases slightly relative to undiffused regions. Solutions of the form given by 
Eqs. l-27a, b are used to aid in designing heat treatments. Ion implantation of 
atoms such as B, N, O, Ti, and Ag has also been used to fabricate LiNb0 3 
waveguides. Regardless of processing treatment only small differences in 
refractive index (An < 1%) result between lithographically defined regions 
and the substrate; but this is enough for waveguiding. 

14.6.5. An Example— The Integrated-Optics RF Spectrum Analyzer 
(Refs. 28, 33) 

Not only is the rf spectrum analyzer an interesting and significant integrated- 
optics system, it provides an opportunity to briefly introduce a thin film 
surface acoustic wave (SAW) device. The purpose of this spectrum analyzer is 
to enable the pilot of a military aircraft to obtain an instantaneous spectral 
analysis of an incoming (microwave) radar beam. This will enable the pilot to 
determine whether the plane is being tracked by a ground station, enemy plane 
or missile, etc. Presumably, the spectral signatures of enemy radar signals 
would be available for comparison by the onboard computer. The analyzer is 
shown in Fig. 14-15. Light from a laser diode is coupled into a planar 
waveguide lens, and then into a Bragg cell, the heart of the system. The rf 
signals to be analyzed are converted into the operational bandwidth of the 
Bragg cell and then fed into the SAW transducer. The latter consists of an 
interlaced, or interdigitated, comblike pair of electrodes. Acoustic waves of the 
same frequency are generated in the piezoelectric LiNb0 3 medium, where 
alternating rarefactions and condensations cause variations in its refractive 
index. This in effect produces moving diffraction gratings, with periods 
determined by the instantaneous acoustic frequencies. Diffraction of light 
occurs at an angle that is a function of the rf signal. A second lens focuses the 
optical beam onto an array of photodetectors, each element of which represents 
a particular frequency channel. The spectrum analyzer was fabricated on a 
LiNb0 3 substrate measuring 7 X 2.5 X 0.3 cm 3 in which a planar waveguide 
was produced by indiffusion of Ti at 1000 °C. Thus, utilizing a few optical 
components a function is performed that would otherwise require thousands of 
electronic elements. 
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WAVEGUIDE LENSES DIFFRACTED LINEAR 



Figure 14-15. Schematic of the integrated optics rf spectrum analyzer (From Ref. 
33). 


14-7, Superlattices 


14.7.1. Introduction 

Thin-film structures composed of periodically alternating single-crystal film 
layers and known as superlattices have generated much recent excitement in 
both scientific research and technical development circles. While the over¬ 
whelming emphasis here is on semiconductors, other layered materials have 
been employed in superlattices. The term superlattice is borrowed from the 
metallurgy of ordered alloys, e.g., AuCu 3 , where below a critical transforma¬ 
tion temperature a compound-like FCC lattice of Au at cube corners and Cu at 
cube face centers, exists. X-ray diffraction reveals extra or superlattice lines 
superimposed over the normal lines expected from the random FCC solid 
solution. So too, the stacking of alternating film layers in an epitaxial (syn¬ 
thetic) superlattice imposes a film-layer periodicity on top of the regular 
atomic-lattice periodicity within each layer. Unlike homogeneous-ordered al¬ 
loys, however, composite film superlattices are capable of displaying a broad 
spectrum of conventional properties as well as a number of interesting quantum 
effects. The former are due to the frequently synergistic extensions of the laws 
of property mixtures that are operative when both layers are relatively thick. 
Quantum behavior, on the other hand, emerges when the layers are very thin 
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for then the wave functions of charge carriers in adjacent layers penetrate the 
barriers and couple with one another. 

Research on synthesized semiconductor superlattices was pioneered by Esaki 
and Tsu in 1970 (Ref. 34). These investigators were exploring the possibility 
of resonant tunneling, a quantum mechanical effect that arises from the 
interaction of electron waves with double and multiple potential barriers. To 
observe quantum effects the de Broglie wavelength X must be comparable with 
the dimensions over which the electric potential varies. For example, consider 
electrons with an energy of E — 0.1 eV and an effective electron mass 
w* = 0.1w 0 , where m 0 is the free-electron mass. That electrons can have 
masses different from m 0 in solids is a quantum effect. Explicitly, m* is 
inversely proportional to the curvature (second derivative) of the energy- 
momentum (E vs. k) variation (shown schematically in Fig. 7-13) at conduc¬ 
tion-band minima; similarly for holes, maxima in the valence band are 
involved. Since the conduction band of GaAs exhibits a narrower E vs. k 
behavior than that for Si, m * (GaAs) < w* (Si). The actual values are m* 
(GaAs) = 0.07w o and m* (Si) = 0.19m 0 . By the De Broglie relation 

X = h / \Jlm*E , (14-8) 

a value of X = 124 A is obtained for m e = QAm 0 . If the characteristic 
dimensions of the wells or barriers are reduced to less than this electron 
mean-free path, the entire electron system enters the quantum regime; formulas 
will now contain Planck’s constant h. 

The spatial variation of the valence and conduction bands in Fig. 14-16 is an 
instructive way to visualize electronic effects in two important types of 
semiconductor superlattices. In Fig. 14-16a the doping superlattice is seen to 
be composed of alternating P-N layers. The same semiconductor material 
accounts for the constant energy gap which is preserved throughout the 
structure while the different doping levels causes the energy steps at junc¬ 
tions. Standard unbiased P-N diode and NPN or PNP transistor structures 
are represented by similar band models. However, diffusional processing 
limitations broaden the junction region and round the potential barriers. In 
contrast the junctions in MBE-grown superlattices are atomically sharp. A 
second type, the compositional superlattice is shown in Fig. 14-16b and 
consists of layers with different energy gaps. 

In the two decades since their advent a large number of thin-film semicon¬ 
ductor superlattice systems have been studied. Table 14-4 provides a brief sum¬ 
mary of this effort including information on lattice mismatch and method of 
deposition. An indication of the kind of structural quality achievable in super¬ 
lattices can be gauged by considering strained epitaxial layers—a worst-case 
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VALENCE BAND 

Figure 14-16. Band structure model of (a) the doping superlattice, (b) the composi¬ 
tional superlattice, (c) the quantum well consisting of 50 A In 0 53 Ga 0 47 As sandwiched 
between layers of InP. 


Table 14-4. Superlattice Systems 



Film Materials 

Lattice Mismatch 

Deposition Method 

]. 

GaAs-Al A .Ga, x As 

0,16% for x = 1 

MBE, MOCVD 

2. 

In! ^Ga A As-GaSb, _^As v 

0.61% 

MBE 

3. 

GaSb-AlSb 

0.66% 

MBE 

4. 

InP-Ga v In, __ r As^Pj 


MBE 

5. 

InP-In, _ A Ga t As, x = 0.47 

0% 

MBE, MOCVD, LPE 

6. 

GaP-GaP, - x As x 

1.86% 

MOCVD 

7. 

GaAs-GaAs, ^ X P A . 

1.79%, A' - 0.5 

MOCVD, CVD 

8. 

Ge-GaAs 

0.08% 

MBE 

9. 

Si~Sij -jfGe^. 

0.92%, x = 0.22 

MBE, CVD 

10. 

CdTe-HgTe 

0.74% 

MBE 

11. 

MnSe-ZnSe 

4,7% 

MBE 

12. 

PbTe-Pb, v Sn A Te 

0.44%, x = 0.2 

CVD 


From Ref. 35. 
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$eSi 


[ 100 ] 


Figure 14-17, (a) Cross-sectional TEM image of Ge 0 4 Si 0 6 -Si strained-layer super¬ 

lattice. (b) The high-resolution lattice image. (Courtesy of J. C. Bean and R. Hull, 
AT& Bell Laboratories). 

scenario insofar as crystalline match is concerned. The cross-sectional trans¬ 
mission electron micrographs reveal the perfection of the layer regularity (Fig. 
14-17) of the Ge 04 Si 06 -Si superlattice; the lattice image clearly shows the 
absence of interfacial dislocations. Similarly, the superlative lattice image of 
Fig. 7-16 indicates the atomic sharpness of the three-component GaAs-AlAs 
interface grown by MBE techniques. 

14.7.2. Compositional Superlattice Quantum Well 

We now consider the fundamental building block of a compositional superlat¬ 
tice— the quantum well. It consists of two semiconductor barrier layers of 
energy gap E g (b ), flanking the semiconductor well film material whose 
energy gap is E g (w), where E g (b ) > E g (w). As an example, the lattice- 
matched InP-In 053 Ga 047 As-InP quantum well structure is shown in Fig. 
14-16c. Electrons within the conduction band of In 0 53 Ga 047 As are essentially 
trapped in a well much like the celebrated particle in a one-dimensional box. If 
for the moment we assume that the well is infinitely deep, the energy levels 
( E e ) are given by the solution to the Schrodinger wave equation. 

h 2 d 2 \f/(x) 

< i4 - 9 > 

where ^ is the wave function and V is the potential energy of the electron 
(V — 0 in this case). The wave function 

yf/{x) = sin \/%tt 2 m*E e /h 2 x 
vL 




(14-10) 
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satisfies Eq. 14-9, and the requirement that ^ vanish at the well boundaries 
x = 0 and x = L yields the well-known formula for the energy levels. Thus 

E e = h 2 n 2 /im*L 2 , (14-11) 


where L is the well width and n is the quantum number (n = 1, 2, 3 etc.). In 
the present case m* = 0.041 m 0 and for L = 50 A, E e (n = 1) is calculated 
to be 0.367 eV. Similarly, the energy of the hole levels in the valence band can 
be calculated assuming that in In 0 53 Ga 0 47 As, the hole mass, m* = 0.41 m 0 . 
Substituting in Eq. 14-11, with m* h replacing m %, we find E h (n = 1) = 0.036 
eV for the lowest hole level. 

The presence of finite energy barrier heights complicates the problem 
considerably. Electrons are now not so hopelessly confined to the well, but 
have a finite probability of existing outside it. Therefore, within the InP barrier 
regions that flank the well, \p is finite but decreases exponentially with distance 
outward. Within the well ^ varies spatially as sine and cosine functions. The 
boundary conditions require continuity of both ^ and (1 / m*)(d\l/ / dx) at each 
barrier interface. Details of the calculation of energy levels involve an exten¬ 
sion beyond the treatment commonly found in elementary texts on quantum 
mechanics in order to account for differing electron masses in the well (w) and 
barrier (b) regions. For symmetric wave functions in the well corresponding to 
the first, third, fifth, etc. energy levels, E e values are given by the solution to 
(Ref. 36) 



j&w 2 m*(w)E e 

i 

3 

^ * 

1 

V h 2 

[<{b)\ 

V 



(14-12) 


Similarly, for antisymmetrical wave functions corresponding to the second, 
fourth, sixth, etc. energy levels 



1 8ir 2 m*{w)E e 

["CM i 

1/2 f 

\1 h 2 

m*(b) 

V 



(14-13) 


Since these equations have no roots for E e > A E c there are only a finite 
number of discrete energy levels; these can be obtained either numerically or 
graphically. The quantity A E c is the quantum well conduction band barrier 
height, commonly known as the conduction band offset , which for this 
system has been measured to be 0.240 eV. For InP, m*(6) = 0.077m 0 and 
numerical solution of Eq. 14-12 yields E e = 0.087 eV for the first energy 
level. A similar calculation for the first hole level gives E h = 0.024 eV where 
the hole mass for InP = 0.64 m 0 and the well valence-band barrier, A E v 
(which replaces A E c above) is equal to 0.360 eV. The absolute energy of the 
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transition between these two levels is given by the sum of the confined particle 
energies (0.11 eV) plus E g for In 0 53 Ga 047 As (0.75 eV) for a total of 0.86 
eV 1 . 

No longer are the well-known solutions to potential well problems the 
province of academic textbooks on quantum theory. Superlattices have made 
possible the realization of quantum mechanical systems replete with spectra 
consisting of states having well-defined energy levels. As with atoms and 
molecules, either photon absorption or emission (photoluminescence) spectra 
have served to confirm the existence of quantum well electron and hole levels. 
Data on the frequencies where maximum absorption occurs are shown in Fig. 
14-18 for the multiquantum well structure composed of GaAs-AlGaAs layers. 



ENERGY (eV) 

Figure 14-1B. Absorption spectra in GaAs-Al 0 2 Ga 0 8 As layers at 2 K for three 
different well thicknesses. (From Ref. 37). 


fProblem in text due to H. Temkin. 
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The fact that the measured energies depend on the superlattice period spacing 
is clear evidence of the quantum behavior of the system. In photolumines¬ 
cence, light impinging on the quantum wells creates electron-hole pairs. These 
populate the energy levels, and during recombination light emission corre¬ 
sponding to the transition occurs. Charge recombination at lattice defects or 
traps seriously degrades light emission, whose intensity can provide evidence 
for the structural perfection of the quantum wells. 

Similar experiments have been performed on structures containing non-rect- 
angular quantum wells. Lattice matched epitaxial layers of precise composition 
and thickness (superlattices) are required to yield the digitally graded well 
shapes. Alternately analog or continuously graded lattice matched parabolic 
and triangular wells have been recently deposited and the energy transitions 
measured (Ref. 38). Through knowledge of E e > information on band offsets 
was extracted using established quantum mechanical solutions for such well 
geometries. In this way, values of AE c /AE g = 0.65 were obtained for both 
Al 0 3 Ga 0 7 As/GaAs and Al 048 In 0 52 As/Ga 048 In 0 52 As lattice matched to InP, 
where A E g is the energy gap difference between the involved semiconductor 
pairs. 

14.7.3. Modulation Doping 

Now we turn our attention to a superlattice where energy wells are produced 
by selective periodic or “modulation” doping. Free-charge carriers in semi¬ 
conductors are generally created by donor and acceptor impurities. During 
conduction the carriers inevitably undergo scattering from the same impurities 
that brought them into being. In superlattices, however, it is possible to 
spatially separate carriers from their parent impurity atoms. Consider the thin 
layers of wide-band-gap Af-doped AlGaAs alternating with undoped narrow¬ 
band-gap GaAs in Fig. 14-19. Electrons released by donors to the conduction 
band of AlGaAs diffuse and drop into the energy wells of the GaAs. Thus the 
trapped electrons are separated from the donor impurities. If an electric field is 
now applied normal to the figure plane, electron motion is confined in two 
dimensions, i.e., solely within the GaAs layers. But in the absence of 
scattering by impurities the mobility of the electrons will be considerably 
enhanced. Whereas uniformly doped GaAs might have a mobility of several 
thousand cm 2 /V-sec at 4 K, mobilities in excess of 10 6 cm 2 /V-sec have been 
measured in modulation-doped heterostructures. 

If the well widths are now significantly reduced, electron energy levels 
become widely spaced. Available electrons must seek higher levels to occupy 
and may even spill over into the AlGaAs conduction band. Because of the 
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Figure 14-19. Band diagram representation of modulation doped AlGaAs-GaAs 
superlattice. Note separation of electrons from ionized donors. 


lower electron mobility there, the structure exhibits a negative differential 
resistance effect; i.e., the current effectively drops with increasing applied 
voltage. 

In creating semiconductor superlattices nothing less than the controlled 
synthesis of band structures is involved. The subject is known as band-gap 
engineering and promises to usher in novel photodetectors as well as new 
types of quantum devices such as tunneling transistors. This interesting subject 
will be further discussed in Section 14.8. 


14.7.4. Metallic Multilayers and Superlattices 

Judged by scientific publications, research on metallic multilayers and superlat¬ 
tices is only one-tenth that of the effort on semiconductor superlattices. 
Nevertheless this interesting activity spans a broad range of film compositions, 
deposited by both PVD and MBE methods, and is used to investigate a variety 
of scientific phenomena. These are broadly listed together with some specific 
effects, properties and sample systems studied (Ref. 39). 

1. Magnetic Measurements— Magneto-optical effects (Co/Au, Fe/Cu); mag¬ 
netization (Ni/Cu, Gd/Y); Mossbauer effect (Fe/Pd, Fe/V); ferromag¬ 
netic resonance (Ni/Mo, FeCo/Si); neutron diffraction (Dy/Y, Fe/Mg) 

2. Elastic Constants— Au/Ni, Cu/Ni, Mo/Ni, Cu/Al, Cu/Pd 

3. Superconductors— Transition temperature (Nb/Ge, Nb 3 Ge/Nb 3 Ir); criti¬ 
cal field (NbN/AIN, Nb/Ti); critical current (Pb/Bi, PbBi/Cr) 

Compared with semiconductors, metals exhibit far less directional bonding; 
this facilitates the incorporation of defects into growing films and at interfaces 
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between layers. Rather than the coherent ordering of lattice-matched semicon¬ 
ductor superlattices the metallic structures are far less perfect. It is, therefore, 
more appropriate to speak of them as ‘ 4 multilayers.’’ Fortunately, many of the 
properties noted above are less sensitive to crystalline perfection than, for 
example, the electrical response of semiconductor superlattices. 

In closing it is convenient to categorize physical phenomena exhibited by 
multilayer structures, in increasing order of complexity as single film, proxim¬ 
ity, coupling and superlattice effects. Single-film effects are primarily limited 
to the film and independent of the substrate or overlayers. The increase in 
electrical resistivity in metal films as their thickness decreases (see Chapter 10) 
is an example. Proximity effects occur because of a boundary between two 
materials. Physical properties are modified by interfacial effects, e.g., charge 
transfer, strain. Examples include superconducting film-normal conducting 
film or magnetic film-nonmagnetic film combinations. Coupling effects occur 
between two like materials across an intervening unlike film material. Interac¬ 
tion distances can vary from angstroms (magnetism) to tens or hundreds of 
angstroms (superconductivity). In principle, the above three cases do not 
strictly require a multifilm structure; single, double or triple film layers of 
appropriate thickness will do. Superlattice effects (e.g., diffraction), in con¬ 
trast, rely on the collective response of the whole periodic structure. 


14.8. Band-Gap Engineering and Quantum Devices 


14.8.1. Introduction 

Band-gap engineering refers to the synthetic tailoring of conduction and 
valence band edges (band gaps) as a function of position within semiconductor 
structures (Refs. 40, 41). Whether it be single phase semiconductors, hetero¬ 
junctions, quantum wells or superlattices, the intent is to create unusual 
electronic transport or optical effects, and novel quantum as well as nonquan¬ 
tum devices (e.g., lasers, photodiodes, transistors) based on them. There is a 
rich assortment of semiconductor alloys, band gaps, and lattice parameters 
available for this purpose. In addition, other types of quantum devices are 
based on superconductive tunneling and some of these will be introduced at the 
end of this section. 

Two types of electronic devices can be distinguished when film dimensions 
are shrunk. Those in the first category are simply scaled-down versions of 
conventional devices. This process of miniaturization continues to be aggres¬ 
sively pursued in further efforts to increase device packing densities and 
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speeds. Electrons and holes in these devices can still be treated as classical 
particles however. In contrast, the wave nature of the electron is fundamental 
to the operation or understanding of quantum devices. When one or more 
device dimensions become comparable to the de Broglie wavelength the 
quantum regime is entered and new effects emerge. This feature has already 
been discussed in Section 14.7.2 in connection with quantum well behavior. It 
was pointed out there that the magnitude of measurable quantum effects varies 
inversely with film dimensions. 

Quantum devices simultaneously combine great challenges of film deposition 
and fabrication with the indescribable sense of accomplishment in reaching 
fundamental limits (e.g. in speed, sensitivity, selectivity, etc.). Crucial to their 
realization are two often coupled technologies. The first is concerned with the 
controllable deposition of nanometer thick heteroepitaxial film layers with 
suitably engineered energy gaps. This necessarily means tight control of film 
composition and thickness, tasks well suited to MBE methods. Secondly, 
lithographic patterning in the submicron and even nanometer regime is fre¬ 
quently required. This is particularly true if electrons are to be further confined 
to one-dimensional quantum well wires , or even three-dimensional quantum 
dots. Thus the concept of lateral rather than the customary thickness superlat¬ 
tices has emerged. Taken together, terms like band-gap engineering and 
nanofabrication express different facets of this remarkable thin-film activity. 
For discussions of quantum interference and size effects as well as electronic 
devices, Refs. 42 and 43 should be consulted. 

14.8.2. Band-Gap Engineering (Ref. 40) 

14.8.2.1. Graded Gap Transistor. Although no quantum effects are dis¬ 
played in this first example the power of band-gap engineering to improve 
conventional device behavior is illustrated. Consider the bipolar NPN transis¬ 
tor shown unbiased in Fig. 14-20a. A current injected in the base layer controls 
the electron flow from the emitter to collector. In conventional transistors the 
base has a uniform energy gap but no electric field. Therefore, electrons can 
only traverse the base region by diffusion, a relatively slow process. One way 
to speed the transport process is to gradually change the base composition. 
This causes the conduction band to tilt just as if an electric field were present. 
Electrons can now drift more rapidly through the base than in the conventional 
device. Recently, the concept was demonstrated in MBE-grown compound 
semiconductor transistors where the 3000-A base region composition was 
graded to yield an effective base electric field of 6 kV/cm. A factor of 10 
increase in electron velocity was measured. 
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of graded-gap base bipolar transistor; (b) multiplication well avalanche photodiode; (c) 
staircase avalanche photodiode (Reprinted with permission from F. Capasso, Science 
235, 172, 1987). 

14,8.2.2. Avalanche Photodiodes. As a second example consider the 
avalanche photodiode (APD), a P-N junction device used to convert light into 
an electric signal. The response of the APD can be considerably enhanced by 
incorporation of a multi quantum well structure. When an electric field is 
applied in the figure plane, the whole superlattice array tilts by an amount 
proportional to the field strength. As indicated in Fig. 14-20b, photon-induced 
electrons are forced to drift downward while holes correspondingly ascend to 
the top. For large electric fields, electrons in the wide-gap semiconductor 
accelerate to the conduction-band edge discontinuity (AE C ), where they fall 
into the well of the low-gap material. The electrons gain enough energy from 
the large A E c increment to impact with lattice atoms ionizing them. This 
creates more carriers, which in turn impact ionize other atoms, etc., and a 
large avalanche current is produced. The superlattice can be so tailored that the 
valence-band discontinuity AE V is not large enough to supply a similar energy 
boost to the free holes. Thus electrons can be made to impact ionize more 
efficiently than holes simply because AE C > AE V . This effect has been used to 
improve the response of avalanche photodiode detectors by reducing the 
amount of excess noise. 

Rather than alternate high- and low-£^ materials the individual superlattice 
layers can be graded from low to high energy gap and then abruptly narrowed 
at the layer interface. Under a large dc field the conduction band of the 
superlattice unfolds into the staircase shown in Fig. 14-20c. This device is the 
solid-state analog of the photomultiplier with the steps equivalent to dynodes. 
Electrons descending steps can efficiently ionize lattice atoms as they are 
propelled by the steep potential drop (high electric field). 
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Figure 14-21. Resonant tunneling heterostructures: (a) resonant tunneling through 
double barrier; (b) resonant tunneling bipolar transistor operation; (c) resonant tunnel¬ 
ing transistor with parabolic quantum well (Reprinted with permission from F. Ca- 
passo, Science 235, 172, 1987). 

14.6.2.3. Tunneling Devices . The phenomenon of resonant tunneling is a 
consequence of the quantum effects in narrow potential wells and is exploited 
in tunneling devices. In Fig. 14-2la we consider an undoped double barrier 
(e.g., AlAs-GaAs-AlAs) sandwiched between two contact layers. Upon appli¬ 
cation of a dc field, electrons originating at the Fermi level of the left contact 
tunnel through the first barrier into the well. Resonant tunneling occurs when 
the energy of the injected carriers lines up with or becomes equal to one of the 
quantum well levels. Carriers then tunnel through to the right contact with a 
maximum in the overall transmission through the double barrier, and in the 
current-voltage ( J-V ) response. Off resonance, less current flows yielding a 
region of negative resistance in the J- V characteristics. The resonant tunnel¬ 
ing effect is essentially equivalent to the transmitted light enhancement in the 
optical Fabry-Perot interferometer (p. 540). 

In the resonant tunneling transistor (RTT) of Fig. 14-2lb a quantum well is 
grown within the base region of an NPN bipolar transistor structure. The 
voltage between the base-emitter junction is tuned so that emitter electrons 
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resonantly tunnel through one of the well energy levels and are collected by the 
reverse biased base-collector junction. At band edge discontinuities electrons 
are injected ballistically , which means they do not suffer appreciable thermal 
or impurity scattering. In moving through the tunneling barrier electrons pass 
from a region of high potential energy to one of lower potential energy. 
Because tunneling is an elastic process, the total electron energy is unchanged, 
and therefore the kinetic energy is correspondingly increased. Rather than 
carriers that diffuse across the base region in conventional transistors, there is 
a well-directed electron beam of high velocity. 

Another proposed RTT configuration incorporates a parabolic quantum well 
in the base region as shown in Fig. 14-2lc. It is well known in quantum theory 
that the energy levels for a parabolic potential (i.e., the harmonic oscillator) 
are equally spaced. In this way a device with N stable states, where N is the 
number of resonant peaks, can be produced. An TV-state memory, with N as 
high as 8, offers the potential of extremely high density data storage. 

Clearly nothing less than “do it yourself quantum mechanics’’ (Ref. 41) is a 
synonym for band-gap engineering. 

14.8.3. Josephson Tunneling Effects Revisited (Refs. 44, 45) 

Perhaps the most remarkable thin-film quantum devices are those based on 
Josephson tunneling. Consider, for example, the SQUID introduced on p. 485. 
A SQUID can measure voltage differences as small as 10” 18 V, currents as 
small as 10“ 18 A, and magnetic fields less than 10“ 14 T. SQUID sensors are 
also characterized by energy, displacement and acceleration sensitivities of 
approximately 10~ 30 J/Hz, 10' 7 cm, and 10 -9 cm/sec 2 , respectively. When 
operated below the critical superconducting temperature the SQUID sensitivity 
approaches fundamental limits imposed by quantum mechanics. No other 
technology comes close to matching this behavior. To better understand 
SQUIDs, the most widely used superconducting devices, some brief comments 
on Josephson junction tunneling effects are first necessary. 

Josephson predicted that interactions of electric (i) and magnetic (H) fields 
with Cooper pairs (Section 10.5.1), which are obscured in superconductors 
proper, should be observable in “weakly” coupled systems. The term weak 
arises because interactions responsible for Cooper pairs extend across the 
junction in attenuated form. A very narrow constriction between two supercon¬ 
ducting regions is one such link. More importantly, an ultrathin oxide film 
barrier (< 50 A thick) sandwiched between superconducting electrodes could 
also serve this function. A cross-sectional TEM image of such a Josephson 
junction structure is shown in Fig. 14-22. The barrier is so thin as to be 
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unresolvable between the PbSb and PblnAu electrodes. When the current 
density (J) is increased from zero, Cooper pairs tunnel through the barrier 
establishing a phase shift (5) between quantum mechanical wave functions in 
either superconductor. The resulting current is 

J = 7 0 sin 6, (14-14) 

where J 0 is the critical junction current. In the Josephson region (J < J 0 ) the 
voltage (V) across the junction is zero (Fig. 10-15d), but for J > J 0 a voltage 
develops. With time (/) the phase difference increases, a phenomenon known 
as the ac Josephson effect. It is characterized by the equation 

dd 4irgV 

~dt ~ h 

where q and h are the electronic charge and Planck's constant, respectively. 

The most unusual Josephson junction effects are associated with magnetic 
fields. There is a nonmonotonic dependence of J on H. As H increases, J 
falls to zero, then rises again, and drops again, etc. The oscillatory J-H 
behavior is the same as that which describes the light wave (Fraunhofer) 
diffraction pattern through a narrow slit. When magnetic flux <f> threads a 
superconducting ring it is quantized with a value of n<b 0 , where n is an 
integer and 4> 0 , the flux quantum, is given by h /2q. The latter has a value of 
2.068 x 10“ 15 Wb or equivalently 2.068 X 10“ 7 gauss-cm 2 . Furthermore, if 
there is a Josephson junction in the superconducting loop, the H fields cause 
interference effects with currents in different parts of the multiply connected 
superconducting circuit. The supercurrent that flows then varies periodically as 
sin2 7r4>/4> 0 , a fact relied on in the measurement of magnetic fields. 

14.8.4. SQUIDs and Their Applications (Refs. 46, 47) 

SQUIDs combine the phenomena of flux quantization and Josephson tunneling. 
Two types may be distinguished as shown in Fig. 14-23, and they consist of 
either one (rf) or two (dc) Josephson junctions inserted into a loop of 
superconducting material. Of the two, the dc SQUID is the more highly 
developed and sensitive one, but the rf SQUID is widely used and has been 
commercially available for nearly two decades. Both are flux-to-voltage trans¬ 
ducers producing an output voltage in response to an applied magnetic flux. In 
the dc case the device is biased with a constant current and the voltage output 
oscillates with a period 4> 0 . A feedback circuit linearizes the response enabling 
detection of less than one flux quantum. The rf SQUID loop is inductively 
coupled to the inductor of a resonant circuit excited at its resonant frequency. 


(14-15) 
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Vrf 


DC SQUID RF SQUID 

Figure 14-23. Dc SQUID consisting of two Josephson tunnel junctions connected in 
parallel in a superconducting loop of inductance L. Each junction is shunted with a 
capacitor C and resistor R to eliminate hysteresis in J- V characteristics. Rf SQUID 
contains one Josephson junction, (From Ref. 44). 

As 4> changes, the amplitude of the rf voltage ( V rf ) is modulated with a per¬ 
iod 4> 0 . 

Both types of SQUIDs fabricated from low-T c superconductors operated at 4 
K have been routinely used as ultrasensitive magnetometers, voltmeters and 
motion detectors. Physicists employ them to search for exotic entities such as 
monopoles and quarks, and geologists prospect for oil and mineral deposits 
with SQUIDs. Biomagnetic applications such as magneto-encephalography are 
among the most interesting. Since the H field in and around the human brain 
is very small and flux densities never exceed 10 ” 12 T, only SQUIDs have the 
required measurement sensitivity, (For comparison the earth’s magnetic field is 
5 x 10 5 T.) Similarly, the electromagnetic activity in the heart has been 
probed this way. For these purposes SQUIDs composed of NbN films sepa¬ 
rated by an ultra thin MgO tunnel barrier have been employed (Ref. 48). These 
devices have lateral resolutions of a few millimeters. Because magnetic signals 
typically fall off as the fourth power of the distance from the source, pickup 
coils must be as close to the body as possible. 

In addition to magnetometry the other major application of SQUIDs has 
been in computer technology (Refs. 49, 50). The reason is that the Josephson 
junction can act as a switch and thus steer signals into desired circuits. 
Josephson junctions are the fastest switches known, capable of changing states 
in as little as 6 ps. The operation can be understood through reference to Fig. 
14-24. One side of the junction (counterelectrode) is connected through a 
resistor to a current source and also to an output device. The other side (base 
electrode) is connected to the electrical ground. If the supply current (I g ) is 
below the critical value, the voltage drop across the junction is zero and there 
is a direct short-circuit to ground. If, however, the control line current ( I c ) is 
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Figure 14-24. Three interconnected superconducting Josephson junctions. Magnetic 
flux threads the loop or ring formed by a pair of junctions. (Reprinted with permission 
from Ref. 50, © 1979 IEEE). 

large enough to produce a magnetic field that penetrates the junction layer, the 
latter switches to the nonsuperconducting resistive state. In effect, the current 
switches from branch 1 to branch 2 in Fig. 10-15d. As a result, voltage 
appears across the junction and current is channeled into the output line. 
Because these junction devices are superconducting the power dissipation is 
extremely small—perhaps a few microwatts. Therefore a few million tightly 
packed Josephson junction circuits would only generate several wans, an 
amount which can be readily dissipated. This in contrast to present semicon¬ 
ductor integrated circuits, where removal of Joule heating is a very major 
concern. 


14.8.5. Materials Problems in Josephson Tunneling Devices 

The remarkable benefits of Josephson junction devices do not come easily. For 
example, there are formidable difficulties in fabricating tunneling barriers and 
maintaining their integrity during operation. Theory predicts and experiments 
have confirmed that a 4-A change in a 40-A-thick barrier film causes the 
Josephson threshold current, an important circuit design parameter, to vary by 
a factor of 10 (Ref. 50). Native oxides, usually employed as tunnel barriers, 
are generally grown by rf sputtering in a low-pressure 0 2 atmosphere. In this 
environment the oxide simultaneously grows and sputters. The oxidation rate 
decreases with oxide thickness, but the sputter rate remains constant. There¬ 
fore, with carefully controlled oxygen pressure, rf power, and substrate 
temperature, a predetermined terminal oxide thickness will form. At this stage 
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oxidation and sputter rates are equal and the oxide thickness is self-limiting. 
After the terminal oxide is established, new oxide continually sputters off. The 
result is removal of contaminants and uniform thickness on all base electrodes. 

A further materials problem aired in Section 9.5.3 concerns barrier mechan¬ 
ical stability during thermal cycling. The use of Sb in the Pb electrode (Fig. 
14-22) is to harden it in order to prevent hillock formation. 

Finally, it must have occurred to the reader, well before this point, that 
high~T c Josephson junction devices should offer significant benefits relative to 
low- T c devices. Unfortunately very small coherence lengths and unstable 
surfaces coupled with fabrication and lithography difficulties has hindered 
progress. Nevertheless, Josephson tunneling across grain boundaries, serving 
as weak links, has been reported (Ref. 51). High-7^ Josephson devices will 
probably not approach the sensitivities reached by low-7^ devices because of 
thermal noise in measurement. 


14.9. Conclusion 


This appears to be an opportune time to end the book or, more truthfully, to 
abandon it. Much remains to be said about other thin-film applications includ¬ 
ing displays, sensors, and assorted electronic, optical, thermal, and surface 
acoustic devices. The book by Chopra and Kaur (Ref. 52) is recommended for 
these and other applications. 



Exercises 



1. A (100) Si wafer is covered with a Si0 2 mask film. A square window in 
the Si0 2 is opened to the bare Si surface by photolithography methods. If 
the sides of the square lie along [110] directions, sketch the shapes of the 
holes produced in the Si after etching in K0H-H 2 0 if 

a. etching is isotropic. 

b. etching is anisotropic (Note: Anisotropic etchants create faceted holes 
composed of crystal planes that are etched slowest.) 

c. the initial square mask window is widened and the etching time is 
reduced. 

d. What shapes are produced if the initial window has a rectangular 
shape? 
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2. The K0H-H 2 0 anisotropic etchant attacks lightly doped Si much more 
rapidly than heavily doped Si. Design a processing schedule involving 
doping to produce large area membranes of arbitrary thickness in Si 
wafers. One application is X-ray masks (see p. 410). 

3. Three thin-film computer-memory schemes (magnetic film, magnetic 
bubble and superconducting) have been considered in this book. Com¬ 
ment on the materials properties issues and technical difficulties that have 
presently rendered these schemes “all but a memory. “ 

4- Estimate the maximum thickness of a CVD diamond film that can grow 
epitaxially on bulk diamond without generating misfit dislocations. 

5. Consider the web coating of a polymer film of thickness, d w , by a 
magnetic alloy of thickness, d m , in the manufacture of magnetic tape. 
The alloy film is sputtered and the thermal power flux density delivered 
to the substrate is P . This heat is removed by the chill roll and the 
resulting temperature history of the web is given by (see Eq. 3-34) 

dT 

P cd w — = P - h(T - T r ). 
dt 

p = web density 
c - web heat capacity 
T r - chill roll temperature 

h = heat transfer coefficient between web and chill roll 

a. Derive an expression for the web temperature if it moves at velocity v 
and a length L is exposed to the depositing atoms at any given instant. 

b. At what velocity must the web travel if d m = 1000 A, d w = 0.05 
mm, T r = 20 °C, and L = 40 cm. Assume the maximum permissible 
web temperature is 100 °C, p = 1.4 g/cm 3 , c = 0.2 cal/g-°C, and 
that h = 9 X 10“ 3 cal/cm 2 -sec-°C. 

The energy per depositing atom is 20 eV. 

6. a. Estimate the substrate temperature that would be required for MBE 

growth of diamond films. 

b. Diamond has an extraordinarily high surface energy. What are the 
prospects for heteroepitaxial growth of diamond films? 

7. Consider a film medium used for magnetic recording that is perpendicu¬ 
larly magnetized with parallel stripe domains separated by 180° bound¬ 
aries. Both magnetostatic (E m ) as well as domain wall energy {E vv ) 
contribute to the total energy ( E r ). If the film thickness if c/, the distance 
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between walls is d w , and 

E m = 1.07 X \0 5 M^d w (in mks units). 

a. Show that 

d . . 

E t = E w — + 1.07 X \O s M?d w . 
d w 

b. What is the equilibrium value of d w ? 

c. If M s = 0.05 T and £,= 1.7x 10" 3 J/m 2 , what is d w if d = 2000 
A? 

8 . By directly counting all Y, Ba, Cu, and O atoms in the unit cell depicted 
in Fig. 14-6, demonstrate that the crystal stoichiometry corresponds to the 
formula YBa 2 Cu 3 0 7 . 

9- Assume that a quantum well infrared detector device based on a semicon¬ 
ductor with E g = 0.9 eV can be simply modeled by an infinite well 
potential. The effective masses of conduction-band electrons and 
valence-band holes are 0.06 m 0 and 0.42 m Q , respectively. 

a. In order to tune operation to 1.0-eV radiation, what well width is 
required? 

b. For a 1.0-eV transition solely between conduction-band electron levels 
how thin must the well be? 


10. The probability P that electrons launched with kinetic energy E toward a 
rectangular barrier (of potential energy V and thickness d) will penetrate 
it by quantum mechanical tunneling is given by 


P(E) = 


1 H-t --sinh 2 

4 E(V- E) 


87 r 2 m. 


'-(V - E) x/2 d 


a. Suppose V - 0.1 eV, E = 0.05 eV, and d = 40 A. Evaluate P. 

b. For a 10% change in d by what factor does P change? 

c. Is the tunneling probability enhanced more by decreasing V or increas¬ 
ing E by the same amount of energy. 


11 . Predict values of A E c and AE V for the following rectangular quantum 
well structures: 


a. Al 0 3 Ga 0 7 As/GaAs/AlQ 3 G3 q 7 As 

b. Al 0 48 In 0 52 As/Ga 0 48 In 0 52 As/Al 0 48 In 0 52 As lattice matched to InP. 
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12- In the Al 0 35 Ga 0 65 As/GaAs/Al 0 35 Ga 0 65 As rectangular quantum well, 
AE C = 0.33 eV, AE V = 0.18 eV, m*(w) = 0.067 m Q , m%w) = 
0.34w o , m*(6) = 0.095/7 ? o and /n*(6) = 0.36m 0 . If the well width is 
150A, 

a. determine the first two electron and hole levels. 

b. indicate four different electron-hole transitions that could be measured 
with a spectrometer. 

c. What are the energies (in eV) of these transitions? 
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Physical Constants 


CONSTANT 

SYMBOL 

VALUE 

Angstrom 

0 

A 

1A - 10” 8 cm = 10” 1 nm 

Avogadro constant 

n a 

6.022 x 10 23 particles/mole 

Boltzmann constant 

k 

1.38 X 10“ 23 J/°K 



= 8.617 X 10 -5 eV/°K 

Gas constant 

R 

1.987 cal/mole °K 

Electronic charge 

q 

1.602 x 10 -19 coulomb 

Electron mass 

m c 

9.11 x 10“ 28 g 

Permittivity in vacuum 


8.85 x 10” 14 F/cm 

Planck constant 

h 

6.626 x 10 34 Joule-sec 

Speed of light 

c 

2.998 x 10 10 cm/sec 


685 







1 ii u 


Appendix 2 -Ee 


H] 


Selected Conversions 


1 Atm = 1.013 x 10 6 dynes/cm 2 = 1.013 x 10 5 Pa = 1.013 x 10 5 N/m 2 

1 Torr = 1 mm Hg = 133.3 Pa 

1 Bar = 0.987 Atm = 750 Torr 

10 10 dynes/cm 2 = 10 9 N/m 2 = 10 9 Pa = 146000 psi 

1 erg = 1 dyne-cm = 10“ 7 Joule 

1 eV = 1.602 x 10“ t2 erg = 1.602 x 10" 19 Joule 

1 eV/atom = 23060 cal/mole 

1 Weber/m 2 = 10 4 Gauss = 1 Tesla 

1 Poise (P) = 1 dyne sec/cm 2 
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A 

Abrasive wear, 573-575 
Abrupt interface, 440 
Absorbing coatings, 538 
Absorbing films, 530 
Absorption, 541 
optical, 509 

Absorption coefficient, 324 
semiconductor, 327 

Absorption spectrum, GaAs-AlGaAs, 666 
Accumulation, 475 

Activated reactive evaporation (ARE), 134, see 
also ARE 

Activation energy, CVD reactions, 173 
Adhesion, 439, 440, 446 
scratch test, 445 
shear test, 445 
tensile test, 445 
tests, 443 
theories, 442 
work, 440 

Adhesive tape test, 444 
Adhesive wear, 573 
Adiabatic heating, 599 
Adsorption, 212 
AES, 250, 275, 278, 283 
Au-Pd diffusion, 366 


GaAs-AlGaAs, 287 
nucleation and growth, 220-221 
principal lines, 281 
quantification, 285 
Si, 251 
Ag 

nucleation, 207 
nucleation, growth, 196 
optical constants, 511 
Ag-Au, interdiffusion, 375 
Ag-Au-Pb, x-ray diffraction, 274 
Ag-U, residual stress, 415 
Al, 598 

electromigration, 380 
etching, 632 
microfractograph, 230 
optical constants, 511 
Al-Au, 376 

interdiffusion, 377 
Al-Cu, 384 

contact reactions, 387, 388 
evaporation, 86 
Al-Si, phase diagram, 30 
A1 2 0 3 , 547, 554, 559, 560, 576 
conduction, 471 
fracture, 569 
free energy, 24 
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Index 


AlAs, properties, 325 
AlGaAs, 327, 658, 659 
MBE, 339 

refractive index, 329 
AlGaAs-GaAs, laser, 323 
Alloy films, 374 
AlSb, properties, 325 
Aluminum, vapor pressure, 82 
Aluminizing, 580, 581 
Amorphous 
carbon, 637 
film, resistivity, 238 
films, 653 

films, Au-Co, Ni-Zr, 236 

phase, 609 

Si, 602 

solid, 8 

thin films, 235 

zone, 612 

Amorphous-ciystalline, optical recording, 654 
Analog grading, 667 
Analytical techniques (table), 250 
Anisotropy, 
magnetic, 486-487 
perpendicular, 498 
Anisotropy field, 488 
Annealing, laser, 595 
Anode sheath, 107 

Antireflection (AR) coating, 507, 528, 532-533 
broadband, 529 
double layer, 529 
single layer, 529 

ARE, compounds synthesized by, 136 see also 
activated reactive evaporation 
Arrhenius plot, 378 
As stabilized growth, 342 
Atomistic theory, nucleation, 206 
Au 

epitaxy, 222 
in Si, 604 
microporosity, 234 
optical constants, 511 
self diffusion, 364 
sintering, 216 
stress-strain, 408 
Au-Co, 236, 617 
resistivity, 238 
Au-Ni, surface alloying, 609 
Au-Pd, diffusion, 366 
Auger electron spectoscopy, see AES 


Autoepitaxy, 200 
Avalanche photodiodes, 671 

B 

Ball bearings, 577-578 
Ballistic transport, 673 
Band gap engineering, 668-670 
Barrier height, 393, 468 
Barrier limited conduction, 466 
Barrier oxide, 677 
BC, 547 

BCS theory, 482 
Beamsplitter, 532 
Bending, 418 
Bias sputtering, 129 
Biaxial stress, 418, 423-424 
Bilayer metals, ion beam mixing, 618 
Bloch wall, 494 
energy, 495 
BN, CVD, 150 
Bond coat, 584 
Bonding, 14 
covalent, 18 
ionic, 17 
metallic, 16 
van der Waals, 19 
Borides, 551 
Boronizing, 580 
Borophosphosilicate glass, 177 
Boundary layer, 163, 165 
Bragg’s law, 7, 343-344 
Bravais lattice, 2-3 
Broadband filter, 539 
Bubbles, magnetic, 499, 501 
Bulge testing, 410 
Bulk diffusion, 224 
Bulk limited conduction, 467 
Burger’s vector, 11 

c 

C-V characteristic, 476 
Cantilever beam, stress measurement, 421 
Capillarity, 198 
Carbides, 551 
alloy, 558 
Carbon, 

amorphous, 637 
ion beam, 637 
Carburizing, 580-581 
CARIS, 253, 258 
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Cascade, ion, 612 
Cathode glow, 105 
Cathode sheath, 107 
CdS, 513 
properties, 325 
CdTe, 

CSVT, 165 
properties, 325 
Characteristic x-rays, 278 
of elements, 281 
Chemical bonding, XPS, 287 
Chemical characterization, 275-300 
detection limit, 276 
resolution, 276 
sensitivity, 276 
table, 276 

Chemical equilibrium, 40 
Chemical reaction, thermodynamics, 22 
Chemical reaction rate, 39 
Chemical vapor deposition (CVD), see CVD 
Chemisorption, 340 
Chromizing, 580 
Clausius-Clapeyron, 82 
Close-space vapor transport (CSVT), 165 
Cluster, nucleation, 206 
Cluster migration, 214, 217 
surface migration, 218 
Co-Cr, 648 

Coalescence, 196, 213, 217 
stress, 430 
Coatings, 549 
ball bearing, 577 
fracture, 566 
wear, 576 
CoCrAly, 547 
Coefficient of friction, 572 
Coercive field, 488 
Coercive force, 489 
Cohesive energy, 561 
Cold mirror, 536 
Cold wall reactor, 175 
Si, 179 

Collision cascade, 612 
Collision diameter, gases, 52 
Columnar grain, 230, 649 
microstructure, 228 
TEM, 229 

Compensation point recording, 651 
Complementary error function (Erfc), 35 
Compositional superlattice, 662, 664 
Compound formation, 367 
CVD, 150 


growth kinetics, 368 
in Au-Al, 379 
Ni-Si, 391 

Compound interface, 440 
Compound semiconductor, 322 
energy gap, 328 
lattice constant, 328 
materials, 324 
properties, 325 
table, 325 

Compounds, AuA1 2 -Au 2 A1, 377 
Computer memory, 488 
magnetic bubble, 501 
SQUID, 675 
thin film, 500-501 
Computer simulation, 
film morphology, 230 
microstructure, 232 
Condensation, 
complete, 210 
extreme incomplete, 213 
incomplete, 210 
Conductance, 54 
cold trap assembly, 59 
orifice, 57 
various shapes, 58 
Conduction, 
barrier limited, 465 
bulk limited, 465 
insulators, 465 
intrinsic, 466 
ionic, 466 
mechanisms, 466 
metals, 456 
Poole-Frenkel, 466 
Schottky emission, 466 
Si 3 N 4 , 470 

space charge limited, 466 
tunneling, 466 
Conduction band offset, 665 
Cone formation, 621-622 
Conformal coverage, 93 
Constitutional supercooling, 606 
Contact resistance, 473 
Al-Si, 474 
Pt Si - Si, 474 
Contacts, 385, 472 
transistor, 386 

Contamination, effect on nucleation, 223 
Convection, 167 
Cooper pairs, 483 
Copper, structure, 3 
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CoSi 2 

resistivity, 462 
TEM image, 7 
Covalent bonding, 18 
Covalent compounds, hard, 551 
Cr-Cu-Au, 372 
Creep, 435 
diffusional, 438 
Critical magnetic field, 480 
superconducting materials, 481 
Critical nucleus, 200 
Critical temperature, 480 
superconducting materials, 481 
Critical wave length, 514 
Cross-tie wall, 494 
energy, 495 

Crucible source, evaporation, 98 
Cryopump, 68 
schematic, 69 

Crystal oscillator, 253, 263 
film thickness, 264 
Crystal structure, 

Bravais lattice, 3 
Diamond cubic, 4 
rock salt, 4 
zinc blende, 4 

CSVT, materials synthesized by, 166 
Cu, 

nucleation effects, 205 
optical constants, 511 
Cu-Ni, diffusion, 356 
Curie point writing, 651 
Current-voltage characteristics, tunneling, 484 
Cutting tools, 548 
CVD, 147, 154 
ball bearings, 579 
coatings, 550 
compound formation, 150 
CSVT, 165 
diamond, 637-638 
diffusion control, 172 
ECR discharge, 184 
equilibrium conditions, 157 
film growth rate, 168-169 
film structure, 175 
film thickness uniformity, 170 
gas transport, 162 
growth kinetics, 167, 171 
hard coating, 554, 557 
hazardous gases, 189 
high temperature, 178 
hydride process, 153 


LECVD, 185 
low temperature, 177 
multilayer coatings, 559 
oxidation, 150 
poly-Si, 176 
pyrolysis, 149 
rapid thermal, 335 
reactor, 178, 555 
reduction, 149 
safety, 188 
SiC, 161 

silicon nitride, 182 
superlattice, 663 
surface reaction control, 172 
thermodynamics, 155, 173 
YBa 2 Cu 3 0 7 , 644 

D 

de Broglie, 662, 670 
Debonding, 569 
Defectless flow, 434 
Defects in solids, 10 
Deformation map, 436 
Delamination, 569 
wear, 573 

Demagnetizing field, 486 
Density, 232, 233 
Depletion, 475 
Deposition rate, 95 
nucleation, 203 
Desorption, 201, 340 
Diamond, 27, 547, 635-636, 638, 640 
properties, 325 
table, 639 

Diamond films, 638 
applications, 640 
properties, 639 

Diamondlike, carbon, 637, 640 
Dielectric breakdown, 477-479 
Dielectric isolation, 333 
Dielectric mirror, 540 
Dielectric stack, 534 
Dies, 620 

Diffraction, optical simulation, 239 
Diffusion, 33, 164, 355, 580 
alloy films, 374 
atomistic, 36 
Au in Au, 364-365 
compounds, 367 
Cr-Cu-Au, 373 
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creep, 435, 438 

dislocation, 358 

free energy gradient, 38 

grain boundary, 358, 361, 363, 374 

interface, 441 

Kirkendall effect, 369 

lattice, 358 

mechanisms, 357-359 
miscible systems, 366 
non-steady-state, 34 
permeation, 374 
size effect, 370 
solder joint, 372 
surface, 361 

surface accumulation, 374 
Diffusion barrier, 389, 392, 395 
sacrificial, 394 

Diffusion barriers, for Al-Si reactions, 393 
Diffusion coatings, 580 
Diffusion coefficient, 33-34, 37, 358 
alkali halides, 208 
GaAs, 360 
metals, 208 
semiconductors, 208 
Si, 360 

Diffusion control, 172 
Diffusion length, 355 
Diffusion pump, 64, 66 
schematic, 65 
Digital grading, 667 
Direct energy band gap, 324, 326 
Directional coupler, 657 
Discharge species, 105 
Dislocation, 
diffusion, 358 
edge, screw, 11 
Dislocation climb, 435 
Dislocation density, 12 
Dislocation dynamics, 409 
Dislocation energy, 316 
Dislocation glide, 434, 437 
Dispersion, 541 
Dispersion relations, 515 
Disproportionation, 151 
Ge, 190 
Si, 192 

Dissociation, 108 
Dissociative attachment, 109 
Dissociative ionization, 109 
Domain, 495 
single, 497 

Domains, in garnet films, 499 


Doping superlattice, 662 
Dose, 612, 614 
to amorphize, 613 
Double crystal diffractometer, 348 
Dynamic SIMS, 299 

E 

Edge dislocation, motion, 12 

EDX, see Energy dispersive x-ray analysis 

Effusion cell, 336 

Electrical conduction, insulators, 465 
Elastic collisions, 107 
Elastic strain energy, 316 
Elasticity, 405 
Electrical properties, 451 

Dependence on electrodes, 453 
Dependence on film preparation, 452 
Electro-optical phase modulator, 657 
Electromigration, 379, 383-384 
damage, 382 
film life, 381 
Electron, 
energy bands, 19 
energy levels, 14 
free, 16 

mean free path, 457 
Electron attachment, 109 
Electron beam evaporation, 99 
Electron beam induced current (EBIC), 269 
Electron beam source, 100 
Electron current, plasma, 106 
Electron cyclotron resonance (ECR), 184 
Electron microprobe, 283 
Electron motion, 

in magnetic and electric fields, 124 
magnetic and electric fields, 125 
Electron scattering, 457, 459 
grain boundary, 461 
Electron spectrometer, AES, XPS, 284 
Electron spectroscopy, 277 
Electronic packaging, 404 
Electronic stopping power, 611 
EUingham diagram, 24-25 
Ellipsometry, 258-260 
film thickness, 253 
Endothermic reaction, 173 
Energy, 
level, 14 
of adhesion, 439 
Energy band, diagrams, 19-20 
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Energy band gap, 514 

Energy dispersive x-ray (EDX) analysis, 269, 
282 

Energy gap, 20 
superconductivity, 483 
Energy levels, 665 
Energy loss, ions, 610-611 
Energy transfer function, 108 
Enthalpy, 22 

compound formation, 561 
Entrapment pump, 62 
Entropy, 22 
Epitaxial film, 
characterization, 343 
crystal defects, 320 
grain boundaries, 321 
growth, 339 
screw dislocations, 320 
stacking faults, 320 
superlattice, 349 
twins, 321 

x-ray diffraction, 347 
Epitaxy, 307-308, 310, 315, 316, 335 
Ag on NaCl, 207 
Au films, 222 
buried silicide, 623 
commensurate, 318 
compound semiconductor, 322 
critical film thickness, 317 
discommensurate, 318 
film growth mechanism, 342 
granular, 227 
II -VI compounds, 331 
interfacial crystallography, 314 
lateral growth, 333 
regrowth, 606 
relaxed, 309 
Si on SiOs* 590 
solid phase, 602-603 
strained layer, 309 
vapor levitation, 335 
vapor phase, 334 

Equilibrium conditions, Si-Cl-H system, 160 
Equilibrium constant, 23 
Equilibrium cooling, LPE, 332 
Erosion, 576-577 
rate, 577 

Error function (Erf), 35 
ESCA, 287, 615 
Etch rate, 634 
anisotropy, 635 
selectivity, 635 


Etching, 631, 634 
dry, 632 
plasma, 632 

Eutectics, ion beam mixing, 618 
Evaporated films, zone structure, 225 
Evaporated metal films, internal stress, 422 
Evaporation, 88, 102-103, 131, 132 
Al-Cu, 86 
alloys, 85 
compounds, 84-85 
early aplications, 79 
electron beam, 99 
film uniformity, 90 
geometry, 87 
hardware, 97 

materials characteristics, 102- 103 
magnetic film, 649 
power density, 99 
reactive, 96 
uniform deposition, 92 
Evaporation flux, 81 
Evaporation rate, 81 
cosine dependence, 88 
Evaporation source, 
refractory metal, 97 
tungsten, 97 
Exchange energy, 486 
Excitation, 108 
Exothermic reaction, 173 

F 

Fabry-Perot, 507, 540 
Faraday effect, 498 
Fatigue wear, 573 
Fe, 598 

Mossbauer spectra, 493 
Fermi energy, 20 
Fermi level, 21 
Ferromagnetism, 485 
Ferromagnets, 485 
amorphous, 492 
Fick’s law, gas diffusion, 165 
Field energy density, magnetic, 485 
Film coverage, 93 

Film deposition, superconductors, 642 
Film growth, 197 
diffusion limited, 397 
reaction limited, 397 
Film growth rate, CVD, 168 
Film purity, 95 
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Film structure, simulation, 239 
Film thickness, 252-264 
measurement, 253 
Film thickness uniformity, 89, 91 
CVD, 169-170 
Film transparency, 531 
Filters, 532 
Flank wear, 576 
Fluorescence, optical, 540 
Fluorescent x-rays, 283 
Forming tools, 548 
Four point probe, 454 
Fracture, 568-569 
coatings, 566 
probability, 567 

Fracture toughness 552, 568-569 
Free electrons, 16 
Free energy 23-24 
CVD, 156 
diamond. 637 
diffusion, 38 
metal oxides, 25 
nucleation, 41, 200 
Si-Cl-H system. 159 
Fresnel equations, 525 
Fretting wear, 573 
Friction, 570, 572 
Fuchs model, 459 

G 

Ga stabilized growth, 342 
Ga-As, phase diagram, 31 
GaAs, 327, 658-659 
adsorption-desorption, 341 
CVD, 152, 154 
diffusion in, 360 
ellipsometry, 260 
growth mechanism, 340 
CPE, 331 
MBE, 339 
RHEED, 347 
GaAs-AlGaAs, 668 
AES, 286 
ellipsometry, 260 
liquid phase epitaxy, 333 
superlattice, 666 
TEM dark Field image, 271 
GaAs-Fe, epitaxy, 315 
GalnAs, 350 
GalnAsP, 342 


GaP, 

CVD, 154 
properties, 325 
Garnet, 498 
Gas diffusion, 165 
Gas flow, 56 
Gas impingement rate, 53 
Gas transfer pump, 62 
Gas transport, 55, 162 
Gas velocities, 49-50 
GaSb, properties, 325 
Gases in films, 96 
GdCo, 235 
Ge, 

CVD, 151 
properties, 325 
Ge-Si, phase diagram, 28 
GeSe, 235 
GeSi-Si, 309 
epitaxy, 318 
superlattice, 664 
Gibbs Phase Rule, 26, 161 
Gibbs-Thompson equation, 215 
Glow discharge, 101 
dc, 104 

Glue layer, 373 
Graded base transistor, 670 
Grain boundaries, 10 
Grain boundary, 
diffusion, 358, 361, 374 
dislocation model, 12 
electron scattering, 461 
Kirkendall effect, 375 
stress, 431 
Grain growth, 227 
Grain size, 10 

Grain structure, simulation, 230 
Graphite, 636, 639 
Gray Filter, 538 
Griffith crack theory, 568 
Growth kinetics, CVD, 167, 171 
Guided mode, 656 

H 

Hagen-Poiseuille flow, 164 
Hall-Petch equation, 409, 564 
Hard carbon, 637, 649 
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Hard coatings, 547, 551, 565 
impurities, 565 
metastable, 565 
texture, 565 
Hard materials, 
properties, 552 
table, 552 
Hardness, 561, 575 
dependence on microstructure, 564 
diamond, 636 
grain size dependence, 564 
load dependence, 563 
table, 552 
testing, 562 
Hazardous gases, 188 
table, 189 

Heat conduction, 596 

Heat of condensation, sputtered metals, 116 
Heat transfer, 553 
Heteroepitaxy, 308, 330 
Heterogeneous nucleation, 43, 199 
Heterojunction, 323 
GaAs-AJAs, 329 
HfN, 556 

HgTe, properties, 325 
High T c superconductor, 641-642 
film deposition 642 
thermal stability 678 
Hillock, formation, 438 
Hillocks, 678 
Homoepitaxy, 307 
Homogeneous nucleation, 200 
Hooke's law, 16, 407, 418, 569 
Hot mirror, 537 
Hot wall reactor, 175 
Hydride process, CVD, 153 
Hydrogen, PECVD films, 183 
Hyperfme magnetic field, 492 
Hysteresis, 
magnetic, 487, 493 
single domain, 498 

i 

IAD, 137, 521 
films deposited by, 138 
ICB, 138-139 
cluster nucleation, 144 
films deposited by, 140 
In 2 0 3 :Sn, 515 

In 2 0 3 :Sn0 2 (IT0), transparent conductor, 517 


[nAs, 

CVD, 154 
properties, 325 
Index of absorption, 509 
Index of refraction, 509, 513 
Indirect energy band gap, 324, 326 
Induced dipole, 514 
Inelastic collisions, 108 
InGaAs, 

MBE, 339 
quantum well, 664 
InP, 330, 350 
CVD, 154 
MBE, 339 
properties, 325 
InSb, properties, 325 
Instantaneous stress, 423 
Insulating films, 465 
Integrated optics, 654-656 
devices, 657-658 
materials, 659 
rf spectrum analyzer, 660 
Interfaces, film-substrate, 440 
Interference, 528 
optical, 507 

Interference pattern, stress, 424 
Interferometry, 253 
FECO, 255 
film stress, 421 
Tolansky method, 254 
transparent films, 256 
Internal stress, 413, 416 
evaporated metal films, 422 
measurement techniques, 421 
sputtered films, 428 
Intrinsic conduction, 466, 469 
Intrinsic stress, 425 

metals, nonmetals, (table), 426 
recrystallization, 430 
temperature dependence, 427 
theories, 429 

thickness dependence, 426 
Inversion, 476 

Ion assisted deposition (LAD), see IAD 

Ion beam, 622 

Ion beam evaporation, 135 

Ion beam mixing, 617 

Ion beam modification, 623 

Ion beams, 610 

Ion bombardment, 622 

Ion channeling, 296 

Ion current, plasma, 106 
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Ion implantation. 111, 591,609-610, 613-615, 
617 

applications, 620 
Ion plating, 133-134, 557 
Ion-surface interactions, 109-110 
Ionic bonding, 17 
Ionic compounds, hard, 551 
Ionic conduction, 466, 469 
Ionization 108, 610 

Ionized cluster beam deposition (ICB), see ICB 
Island growth, 197, 200 

j 

Josephson, 673 

Josephson device, 674, 676-678 
Josephson junction, 674, 676 
Josephson tunneling, 485 
Junction spiking, 387 

K 

Kaufman source, 137 
Kinetic energy, sputtered metals, 116 
Kinetic theory of gases, 47 
Kinetics, 33, 368 
compound formation, 378 
oxidation, 397 
Si0 2 growth, 397 
Kirkendall effect 234, 369 
grain boundary, 375-376 
Knoop hardness, 562 
Knudsen cell, 88 
Knudsen number 56-57 

L 

Laminar flow 56, 163 

Laser, 323, 540, 589-590, 593, 595-596, 
598-600, 602, 607, 609, 650-651 
cw, 592 
heating, 597 
interaction time, 590 
power density, 590 
pulsed, 592 
Q-switchcd, 592 
sources , 59] 

Laser annealing, 595-596 
Laser assisted CVD 
photolytic, 185 
pyrolytic, 185 


User enhanced CVD (LECVD), 148, 185 

User modification, 602 

Laser processing, 590 

Laser properties, table, 593 

User scanning, 594 

User surface alloying (LSA), 607 

Uteral epitaxial growth, 334 

Uttice diffusion, 208, 358 

Uttice misfit, 315-316 

Uyer growth, 197, 200 

Leak rate, 74 

LECVD, see User enhanced CVD 
LEED, 250, 343-344 
Si, 345 

Left standing model, 621 
LEGO 333-334 
Lever rule, 29 
Light emitting diode, 324 
LiNb0 3 , 658-660 
Liquid phase epitaxy (LPE), 331 
reactor, 332 
LiTa0 3 , 659 
Lithography, 630 
Longitudinal recording, 646, 648 
Lorentz force, 123 
Lorentz microscopy, 496 
Lorenz-Lorentz equation, 514 
Low energy electron diffraction (LEED), see 
LEED 

Low pressure CVD (LPCVD), 148, 179 
reactor, 180 
silicon nitride, 182 
LSS theory, 611, 615 
Lubrication, 572, 578 

M 

Magnetic bubbles, 499 
size, 499 
speed, 500 
Magnetic disk, 645 
Magnetic domains 489, 494 
Magnetic film, 649 
Magnetic materials, 
hard, 487, 488 
soft, 487, 488 

Magnetic recording 488, 645-646, 648 
Magnetic tape, 645 
Magnetic thin films, 488 
memory applications, 493 
size effects, 489 
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Magnetization 485 
film thickness dependence, 490 
perpendicular, 498 
saturation, 488 
surface, 491 

temperature dependence, 490 
Magneto-optical recording, 650 
materials, 652 

Magnetometry, SQUIB, 675 
Magnetoresistance, 502, 647 
Magnetostatic energy, 487 
Magnetron sputtering, 115-116, 123, 125-126 
Mask, 630 
generation, 631 
Mass resolution, 299 
Matthiessen’s rule. 455-456 
Maxwell-Boltzmann distribution, 48 
MBE, 336-339 
superlattice, 663 
MCrAIY, 584 

Mean free path, gas molecules, 52-53 

Mechanical anchoring, 441 

Mechanical equilibrium, 416 

Mechanical interface, 442 

Mechanical properties, 403 

Mechanically functional surface, 618 

Meissner effect, 481 

Melt depth, 598 

Mesotaxy, 623 

Metal carbides, alloy, 558 

Metal compounds, hard, 551 

Metal silicides, 389 

Metal-semiconductor contact, 472 

Metallic bonding, 16 

Metallization, step coverage, 94 

Metallurgical coating, 547 

Metallurgical coatings, 550 

Metalorganic CVD (MOCVD), see MOCVD 

Metals, 

electrical conduction, 456 
optical properties, 510 
Metastable phase, 609 
MgF 2 , 513, 528, 535 
Microassembly, 634 
Microbalance, 263 
Microbeams, 412 
Microhardness, 562 
Micromachining, 633 
Microstructure, 

CVD coatings, 556 
PVD coatings, 557 
Microvoids, 233 


Miller Indices, 5-6 
Mirrors, 508, 510, 530 
Ag, Al, 513 
Misfit dislocations, 322 
Mo, CVD, 149 
MOCVD, 186-187 
superlattice, 663 
Modification 
composition, 613 

mechanically functional surfaces, 618 
of Si, 622 
structure, 610 

Modification of surfaces, 589 

Modulation doping, 667-668 

Molecular beam epitaxy (MBE), see MBE. 

Molecular field, 491 

Molecular flow, 56-57 

Molecular velocities, 48 

MOMBE, 338 

Monolayer gas coverage, 54 

Morphology, columnar, 228 

MOS capacitance, 476 

MOS structure, 475 

Mossbauer effect, 492-493 

Multilayer coatings, 559-560 

Multilayer optical films, 531 

Multilayers, 668 

Multiple beam interferometry, 253 

N 

Na vapor lamp, 536 
Nanoindenter, 412-413, 564 
Narrow band filter, 540 
NbN, 675 
Neel wall 494 
energy, 495 

Nemst-Einstein equation, 39, 356, 435 
electromigration, 383 
Neutral filter, 538 
Ni, 598, 601 
CVD, 149 

implanted in stainless steel, 615 
Ni-Si, 391 
Ni-Zr, 236 
TEM micrograph, 239 
Ni 2 Si, 391 
NiAl, 581 
NiSi, 391 
NiSi 2 , 391 
Nitrides, 551 
alloy, 558 



Index 


699 


Nitriding, 580-581 
Nonabsorbing film, 524, 526 
Nuclear stopping power, 611 
Nucleation, 40, 195, 200, 202 
atomistic theory, 206 
charge effects, 218 
contamination, 223 
deposition rate. 223 
effect of temperature, 223 
heterogeneous, 43, 199 
homogeneous, 41 
kinetic models, 209 
metals on alkali halides, 223 
noble metal-alkali halide, 212 
parameters, 213 
substrate, 223 
TEM, 196 

transition temperature, 208 
Nucleation and growth, 219 
AES, 221 

Nucleation rate, 42-43, 201, 203, 212 
Ag. 207 

deposition rate, 204 
one atom nucleus, 210 
substrate temperature, 204 
temperature dependence, 211 
time dependence, 211 

o 

Ohm’s law, 452 
One component system, 26 
Optical communications, 329 
Optical constants, 511 
Optical filter 535, 538 
broadband, 539 
edge, 539 
narrow band, 540 
UV, 539 

Optical properties, 507, 518 
dependence on deposition rate, 521 
dielectrics, 513, 517, 519 
environmental effects, 523 
metals, 510 
process variables, 522 
semiconductors, 515, 517 
Optical recording, 650-651, 653 
Optical waveguides, 655-656 
channel, 657 
embedded, 657 
thin film, 658 


Optoelectronics, 323 
Orbitals, 
sp, 636, 640 

Ostwald ripening, 214-215 
Outgassing, 74 
Oval defect, 321 
Oxidation, 396, 581-582 
CVD, J50 
kinetics, 397 
Oxidation rate, 582 
inverse logarithmic, 583 
logarithmic, 583 
Oxidation wear, 574 
Oxygen, content in films, 96 

p 

P 2 O s , CVD, 150 
Packing density, 518, 520 
Packing factor, 233, 263 
Partition coefficient, 603 
Passive compound barrier, 394 
Pb-In-Au, strain relaxation, 436 
PECVD, 148, 181 
materials deposited by (table), 183 
silicon nitride, 182 
Permalloy, 486-487, 494, 502, 647 
Permanent dipole, 514 
Perpendicular recording, 646, 648 
Phase change recording, 650, 653 
Phase diagram, 32-33 
analysis, 29 
binary, 29 
carbon, 27 

pressure-temperature, 27 
Phase diagrams, 26 
Photoresist, 631, 633 
Photothermal coatings, 537 
Physical vapor deposition (PVD), see PVD 
Phy si sorption, 340 
Pilling-Bedworth ratio, 583 
Plasma, 105 
energy, 115 
etching, 632 
ionization, 108 
ions, 106 
microwave, 184 
neutrals, 106 

Plasma energy, sputtered metals, 116 
Plasma enhanced CVD (PECVD), see PECVD 
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Plasma spraying, 585 
Plasma wavelength, 515 
Plasticity, 404 

Point source, evaporation, 87 
Point source, 
evaporation, 89 
film uniformity, 90 
Polarizability, 514 
Poly-Si, 632 
bulge test, 411 
CVD, 176 

Poole-Frenkel conduction, 466, 469, 472 
Porosity, 233 

Power dissipation, electron beam evaporation, 
101 

Precursor gases, MOCVD, 187 
Pressure, 50 
units, 51 
Printing, 631 
Profilometry, 261-262 
Projected range, 614 
Prosthesis, 619 
Protective coatings, 547 
Protective oxide, 583 
Pseudomorphic growth, 316 
Pi-Si, RBS, 290 
PtSi, RBS spectrum, 295 
Pumping, 55 
Pumping speed, 60 
Pumping time, 61, 74 
Purple plague, 376 
PVD, 80 

ball bearings, 579 
coatings, 550 
hard coating, 557 
reactor, 555 
Pyrolysis, 149 

Q 

Quantum devices, 669-670 
Quantum well, 664-665, 667, 671 
GaAs-AlGaAs, 666 
Quartz, 9 
Quench rate, 598 

heated surface, 599, 601 

R 

Radiation limited temperature, 117 
Raoult’s law, 86 


RBS, 111, 250, 275, 289-296 
accelerator, 292 
Al-Au, 377 
Au-Co, 617 
Au-Pd diffusion, 366 
beam line, 292 
elemental analysis, 293 
quantification, 294 
Si-Bi, 604 
spectrometer, 292 
Reaction rate control, 172 
Reactions, 355 
Al-Cu, 387-388 
metal-semiconductor, 385 
Reactive evaporation, 135 
Reactive ion plating (RIP), 135 
Reactive sputtering, 126 
deposition rate, 128 
hysteresis, 127 
stoichiometry, 129 
target purity, 128 
Reactor (CVD), 
barrel, 178 
pancake, 178 
tubular, 178 
Real growth model, 621 
Reciprocal lattice, 313 
Reconstruction, 310 
Recording, 
magnetic, 645 
optical, 650 
Recording head, 647 
Recording media, 647 
Recrystallization 
optica] recording, 653 
stress, 430 
Reduction, 149 

Reflectance, 256, 510, 527, 535 
Al, 512 

effect of deposition rate, 512 
In 2 0 3 :Sn, 516 
metals, 511 
multilayer, 534 
Reflection coefficient, 525 
Reflectivity, double layer, 533 
Refractive index, 518, 528, 530, 534, 541 
dependence on film structure, 519 
Ti0 2 , 523 

Refractory metal, evaporation source, 87 
Regrowth, epitaxial Si, 602-603 
Residual gas, effect on film purity, 95 
Residual resistivity ratio, 456 
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Residual stress, 414 
Resist, 630 
Resistivity, 453 
CoSi 2 , 462 
four point probe, 454 
metal film, 458, 459 
metals, 456 

Van der Pauw method, 454 
Resonant tunneling, 672 
Reversible transfer, 152 
Reynolds number, 163 
RF spectrum analyzer, 660 
RF sputtering, 121 
current-voltage, 122 
RHEED, 208, 343-344 
GaAs, 347 
Si, 345 

RHEED oscillations, 346 
Roots pump, 64 
Rotary piston pump, 62-63 
Rotary vane pump, 62 
RTCVD, 335 

Rutherford backscattering, see RBS 

s 


P~N junction, 21 
synthesized by MOCVD, 187 
Shadowing, 224 
Shear stress, 406 
Sheet resistance, 454-455, 516 
Si, 235, 600, 603 
bulge test, 410 
CVD, 149, 158 
CVD deposition kinetics, 173 
CVD growth rate, 172 
diffusion in, 360 
disproportionation, 152 
epitaxial regrowth, 530 
epitaxy, 179 
etch rate, 634 
etching, 632, 635 
LEED, RHEED, 345 
micromachining, 633 
properties, 325 
residual stress, 410 
STM, 312 

surface modification, 602 
thermodynamics of CVD, 158 
Si-Au, amorphous, 605 
Si-Bi, 604 
Si 3 N 4 , 151 
conduction, 470-471 
SiC, 161, 547 
CVD, 150 

Silicide, Kirkendall effect, 369 
Silicides, 389 
formation (table), 390 
ion beam mixing, 618 
Siiiciding, 580 
Silicon nitride, 181 
CVD, 182 
properties, 182 

Silicon on insulator, LEGO, 333 
Silicon on insulator (SOI), 310 
SIMOX, 623 

SIMS, 250, 275, 296-297 
spectrometer, 298 
static, 299 
Sintering, 214-215 
Au on molybdite, 216 
kinetics, 216 
surface diffusion, 217 
Si0 2 , 396, 513 
amorphous solid, 9 
conduction, 471 
crystalline, 9 
CVD, 150, 177 


Sacrificial barrier, 394 
SAW device, 660 

Scanning electron microscopy (SEM), see SEM 

Scanning tunneling microscope (STM), 311 

Scattering, diffuse, 458 

Schottky barrier, 473 

Schottky diode, 473 

Schottky emission, 466-467, 474 

Scratch test, 444 

Se, 235 

Secondary electrons, 267 
Secondary ion mass spectroscopy (SIMS), see 
SIMS 


Seed atoms, 622 
Seeman-Bohlin geometry, 274 
Segregation coefficient, 603 
SEM, 250, 265-266, 275 
backscattered electrons, 268 
nucleation, 219 
Semiconductor, 
energy gap, 21 
intrinsic, 21 
N type, 21 
P type, 21 
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Si0 2 ( Continued) 
density, 514 

dielectric breakdown, 479 
etch, 632 
etch rate, 634 
etching, 635 
free energy, 24 
growth kinetics, 397 
oxidation kinetics, 397-398 
stress, 432 
subsurface, 623 
Size effect, 452, 458 
diffusion, 370 
electrical resistivity, 459 
magnetic films, 489 
Smooth growth, 208 
Snell’s law, 655 
Solder, 372 
Solidification, 599 
surface film, 604 
Solidification rate, 601 
Solute trapping, 605 

Space charge limited conduction, 466, 469 
Spring-dashpot, 433 

Sputter gases, effect on sputter yield, 113 
Sputter yield, 111-112, 113 
metals, 113 
Sputter-ion pump, 70 
schematic, 71 

Sputtered films, stress, 427 
Sputtering, 109, 131 
alloys, 113 
bias, 129 
dc, 102 

early applications, 79 
hysteresis, 126 
magnetron, 116, 118, 125 
reactive, 118, 126 
RF, 103, 118, 121 
stoichiometry, 114 
table, 132 
targets, 118 

targets (table), 119-120 
YBa 2 Cu 3 0 7 , 644 
zone structure, 225 
SQUID, 485, 673-674 
dc, 675 

magnetometry, 675 
RF, 675 

Static SIMS, 299 
Step cooling, LPE, 332 
Step coverage, 94 


Step gauge, 253, 258 

Stoichiometry, reactive sputtering, 129 

Stoney equation, 416 

Stopping power, 113 

Straggle, 614 

Strain, 406 

Strain relaxation, 433, 435 
superconductor film, 437 
Strained layer, superlattice, 664 
Strained layer epitaxy, 318 
Stranski-Krastanov growth, 197 
Stress, 405 

Stress relaxation, 432, 438 
Stress-strain, Au film, 408 
Structural characterization, 265-275 
Stuffed barrier, 393 
Stylus, film thickness, 253 
Sublimation, 83 
heat of, 99 

Sublimation furnace, 98 
Substrate, 

temperature-time, 117 
thermal history, 115-116 
Substrate temperature, nucleation, 203 
Sulfidation, 583 
Superconducting materials, 481 
type I, 482 
type II, 482 

Superconductivity, 480, 482, 641 
tunneling, 483 
Superconductor, 641 

Superconductor film, thermal cycling, 437 
Superlattice, 661-662 
compositional, 663 
doping, 663 
strained layer, 664 
table of systems, 663 
x-ray diffraction, 349 
Superlattices, 668 
Supersaturation, 41 
Surface 
contacts, 571 
cooling, 598 
electron scattering, 457 
heating, 598 

Surface alloying, 607, 609 
Surface analysis, 276 
applications, 301 
Surface crystallography, 312 
Surface diffusion, 202, 208, 224, 361 
Surface modification, 589, 593, 602 
Surface morphology, 621 
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Surface reconstruction, 344 
Surface source , 
evaporation, 87, 89 
film uniformity. 90 
Surface structure, 621 
Surgical implant, 619 


T 


Ta, 

dc sputtering, 130 
RF sputtering, 130 
TaN, resistivity, 129 
Targets, 
boride, 119 
carbide, 119 
chalcogenide, 120 
fluoride, 119 
metal, 119 
nitride, 120 
oxide, 119 
silicide, 120 
Taylor equation, 586 
TEM, 250, 269-270 
amorphous films, 238 
Au-Co, 237 
cross section, 273 
dark field, 271 
film cross section, 229 
nucleation, 196, 219 
Tensile properties, metal films, 407 
Tensile strength, 409 
Tensile stress, 405 
Tetrahedral bond, 6, 9 
Thermal barrier coating, 584 
Thermal coatings, 547, 584 
Thermal conductivity, 
diamond, 636 
table, 552 

Thermal cycling, 437 
Thermal diffusion, 596 
Thermal expansion, table, 552 
Thermal history, sputtering, 115 
Thermal shock, diamond, 636 
Thermal shock parameter, 553 
Thermal spike, 611 
Thermal stress, 419, 427, 553 
Thermodynamics, 21 
chemical, 23 
CVD, 155 
nucleation, 198 
Si-Cl-H system, 159-160 


Thick film resistors, 463 
Thin film, resistors, 463 
Thin film model, 239 
perfect structure, 240 
stacking fault, 240 
Thin film optics, 524 
Thomson model, 458 
Throughput, 57 
Ti, wear, 619 
Ti sublimation pump, 70 
Ti-W, 118 
TiAI 3 , 395 

TiC, 547, 554, 556, 559-560, 576, 578-579 
CVD, 150 
thermal stress, 419 
TiCN, 556, 560 

TiN, 547, 554, 556-557, 559-560, 563, 
575-576, 578 
Ti0 2 , 502, 547 

refractive index, 523 
TiSi 2 , 395 
Titanium, 

electron excitation processes, 280 
electron spectroscopy, 280 
x-ray spectroscopy, 280 
Tool wear, 574 
Tools, 620 
Toughness, 569 
Townsend equation, 142 
Transistor, 
bipolar, 386 
graded base, 670-671 
MOS. 386 
tunneling, 672-673 
Transition metal compounds, 551 
Transmission, optical system, 533 
Transmission coefficient, 525 
Transmission electron microscopy (TEM), see 
TEM 

Transmittance, 526-527 
Transmittance range, 518 
Transparent conductors, 516 
Treshold energy, sputtering, 112 
Tribology, 570, 619 
Triode, 633 

Tungsten, evaporation source, 97 
Tunneling, 466, 468, 471, 474, 673 
Josephson, 485, 676 
resonant, 672 

superconductivity, 483-484 
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Tunneling devices, 672 
Turbomolecular pump, 66, 68 
schematic, 67 
Turbulent flow, 56 
Two component system, 27 

u 

Uniaxial anisotropy, 486 

v 

Vacancy, 234 
concentration, 13 
formation energy, 13 
Vacuum, leak rate, 74 
Vacuum deposition system, 72-73 
Vacuum pumps, 62 
Vacuum science, 47 
Valence band offset, 665 
VAMFO, 253, 257 
Van der Pauw method, 454-455 
Van der Waals, 
adhesion, 442 
bonding, 19 
bonds in graphite. 636 
solid, l 

Vapor deposition, hard coatings, 554 
Vapor pressure, 82 
elements, 83 

semiconductor elements, 84 
Vapor transport, CVD, 155 
Vickers hardness, 562 
Viscosity, 164 

Viscous How, 56-57, 162, 164 
stress relaxation, 433 

w 

W, CVD, 149 
Wear, 570, 573, 619-620 
tool, 574-575 
Weibull distribution, 567 
Whisker, 622 
Work Function, 20 


x 

X-ray diffraction, 273, 347 
film stress, 421, 425 
GalnAs-InP, 350 
rocking curve, 348 
superlattice, 349 

X ray photoelectron spectroscopy (XPS), 250, 
275 

X-ray spectroscopy, 278 
quantification, 283 
Ti, 279 

XPS, 282, 286-287 
angle resolved, 288 
GaAs, 288 

Y 

Y 2 0 3 , CVD. 156-157 
Y 3 Fe 5 0 12 , 498, 500 
YBa 2 Cu 3 0 7 , 643 
film preparation, 644 
in superconductors, 641 
resistance-temperature, 643 
structure, 642 
Young's equation, 199 
Young’s modulus, 407 
table, 552 

z 

ZnS, 331, 513, 521, 535 
microstructure, 520 
properties, 325 
ZnSe, 331, 513 
properties, 325 
ZnSe-GaAs, 348 
ZnTe, 513 
properties, 325 
Zone structure, 224-225, 227 
evaporated films, 226 
Zr0 2 , 547, 584 
Y stabilised, 584 
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